Title :
Latent Semantic Kernels for WordNet: Transforming a Tree-Like Structure into a Matrix
Author :
Kim, Young-Bum ; Kim, Yu-Seop
Author_Institution :
Dept. of Comput. Eng., Hallym Univ., Chuncheon
Abstract :
WordNet is one of the most widely used linguistic resources in the computational linguistics society. However, many applications using the WordNet hierarchical structure are suffering from the word sense disambiguation (WSD) caused by its polysemy. In order to solve the problem, we propose a matrix representing the WordNet hierarchical structure. Firstly, we transform a term as a vector with elements of each corresponding to a synset of WordNet. Then, with singular value decomposition (SVD), we reduce the dimension size of the vector to represent the latent semantic structure. For evaluation, we implement an automatic assessment system for short essays and acquire reliable accuracy. As a result, the scores which are assessed by the automatic assessment system are significantly correlated with those of human assessors. The new WordNet is expected to be easily combined with other matrix-based approaches.
Keywords :
singular value decomposition; text analysis; tree data structures; WordNet hierarchical structure; computational linguistics; latent semantic kernels; latent semantic structure; linguistic resource; matrix structure; polysemy; short essays; singular value decomposition; tree-like structure; word sense disambiguation; Application software; Computational linguistics; Humans; Information retrieval; Information technology; Kernel; Matrix decomposition; Singular value decomposition; Automatic Assessment System; Latent Semantic Kernel; Matrix; Tree; WordNet;
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
Conference_Location :
Dalian Liaoning
Print_ISBN :
978-0-7695-3273-8
DOI :
10.1109/ALPIT.2008.40