Title :
Improvement on corpus-based word similarity using vector space models
Author :
Esin, Yunus Emre ; Alan, Özgür ; Alpaslan, Ferda Nur
Author_Institution :
Dept. of Comput. Eng., Middle East Tech. Univ., Ankara, Turkey
Abstract :
This paper presents a new approach for finding semantically similar words from large text collection using window based context methods. Previous studies on this problem mainly concentrate on finding new methods which are new combination of distance-weight measurement methods or new context methods. The main difference of our approach is that we focus on reprocessing of existing methods´ outputs to update the representation of related word vectors, which are used for measuring semantic distance between words, to further improve the results. This new approach can be easily applied to many of the existing word similarity methods using the vector space model for representing contexts. We claim that our method improves the performance of some of the existing similarity measuring methods.
Keywords :
text analysis; corpus-based word similarity; distance-weight measurement methods; semantic distance; vector space models; window based context methods; Computer science; Context modeling; Databases; Dictionaries; Educational institutions; History; Humans; Natural language processing; Natural languages; Thesauri;
Conference_Titel :
Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
Conference_Location :
Guzelyurt
Print_ISBN :
978-1-4244-5021-3
Electronic_ISBN :
978-1-4244-5023-7
DOI :
10.1109/ISCIS.2009.5291827