DocumentCode :
144784
Title :
Word sense disambiguation in Nepali language
Author :
Dhungana, Udaya Raj ; Shakya, Sunny
Author_Institution :
Dept. of Electron. & Comput. Eng., Tribhuvan Univ., Lalitpur, Nepal
fYear :
2014
fDate :
6-8 May 2014
Firstpage :
46
Lastpage :
50
Abstract :
This paper presents a modified Adapted Lesk algorithm for word sense disambiguation in Nepali language. We included the synset, gloss, example and hypernym of the context words to form the final collection of context words. The context window contains all the words from the whole sentence except articles, prepositions and pronouns. The collection of words for each sense of a target word is also formed by including their synset, gloss, example and hypernym. Each word in the collection of context words is compared with every word in the collection of words for each sense of a target word to count the overlaps. Moreover, the numbers of examples for each word in our sample Nepali WordNet have been increased to four in average. The experiments performed on 348 words including 59 polysemy words shows the accuracy level of our algorithm to be 88.05%.
Keywords :
natural language processing; Nepali language; context window; context word collection; context word example; context word gloss; context word hypernym; context word synset; modified adapted Lesk algorithm; polysemy words; sample Nepali WordNet; word sense disambiguation; Accuracy; Context; Dictionaries; Gold; Natural language processing; Solids; Context Word; Hypernymy; Synset; Target Word; Word Sense Disambiguation; WordNet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information and Communication Technology and it's Applications (DICTAP), 2014 Fourth International Conference on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4799-3723-3
Type :
conf
DOI :
10.1109/DICTAP.2014.6821655
Filename :
6821655
Link To Document :
بازگشت