Title :
Resources for Nepali Word Sense Disambiguation
Author :
Shrestha, Niraj ; Hall, Patrick A V ; Bista, Sanat K.
Author_Institution :
Inf. & Language, Process. Res. Lab., Kathmandu Univ., Kathmandu
Abstract :
Word sense disambiguation (WSD) is a process of identifying proper meaning of words that may have multiple meanings. It is regarded as one of the most challenging problems in the field of natural language processing (NLP). Nepali Language also has words that have multiple meanings, thus giving rise to the problem of WSD in it. In this paper, we investigate the impact of NLP resources like morphology analyzer (MA) and machine readable dictionary (MRD) in ambiguity resolution. Our results show that the accuracy in WSD is better with the availability of NLP resources like morph analyzer, MRD etc. Lesk algorithm has been used to solve WSD problem using a sample Nepali WordNet containing few sets of Nepali nouns and the system is able to disambiguate these nouns only. The system was tested on a small set of data with limited number of nouns. The accuracy reading was between 50% - 70% depending on the sample data provided. When the same data was tested through manual morph analysis, the accuracy was seen to be considerably high (80%).
Keywords :
dictionaries; natural language processing; word processing; Lesk algorithm; Nepali word sense disambiguation; machine readable dictionary; morphology analyzer; natural language processing; Availability; Computer science; Dictionaries; Information retrieval; Morphology; Natural language processing; Natural languages; Software systems; Speech processing; System testing; Language; Lesk Algorithm; Nepali WordNet; WSD;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4515-8
Electronic_ISBN :
978-1-4244-2780-2
DOI :
10.1109/NLPKE.2008.4906758