Title :
Adding intelligence to non-corpus based word sense disambiguation
Author :
Charhate, Sayali ; Dani, Asmita ; Sugandhi, R. ; Patil, Vaibhav
Author_Institution :
Cognizant Technol. Solutions India Pvt. Ltd., Pune, India
Abstract :
Natural language processing applications invariably perform word sense disambiguation as one of its processing steps. The accuracy of sense disambiguation depends upon an efficient algorithm as well as a reliable knowledge-base in the form of annotated corpus and/or dictionaries in machine readable form. Algorithms working on corpus for sense disambiguation are generally employed as supervised machine learning systems. But such systems need ample training on the corpus before being applied on the actual data set. This paper discusses an unsupervised approach of a graph-based technique that solely works on a machine-readable dictionary as the knowledge source. This approach can improve the bottleneck problem that persists in corpus-based word sense disambiguation. The method described here attempts to make the algorithm more intelligent by considering various WordNet semantic relations and auto-filtration of content words before graph generation.
Keywords :
graph theory; knowledge based systems; learning (artificial intelligence); natural language processing; WordNet semantic relations; annotated corpus; content words auto-filtration; dictionaries; graph generation; graph-based technique; knowledge-base; machine-readable dictionary; natural language processing applications; noncorpus based word sense disambiguation; supervised machine learning systems; Context; Cranes; Dictionaries; Natural language processing; Rivers; Semantics; Taxonomy; Natural Language Processing; Semantic networks; Semi-supervised Machine Learning; Similarity Measures; Text Mining; Unsupervised Machine Learning; Word Sense Disambiguation;
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2012 12th International Conference on
Conference_Location :
Pune
Print_ISBN :
978-1-4673-5114-0
DOI :
10.1109/HIS.2012.6421329