DocumentCode
144784
Title
Word sense disambiguation in Nepali language
Author
Dhungana, Udaya Raj ; Shakya, Sunny
Author_Institution
Dept. of Electron. & Comput. Eng., Tribhuvan Univ., Lalitpur, Nepal
fYear
2014
fDate
6-8 May 2014
Firstpage
46
Lastpage
50
Abstract
This paper presents a modified Adapted Lesk algorithm for word sense disambiguation in Nepali language. We included the synset, gloss, example and hypernym of the context words to form the final collection of context words. The context window contains all the words from the whole sentence except articles, prepositions and pronouns. The collection of words for each sense of a target word is also formed by including their synset, gloss, example and hypernym. Each word in the collection of context words is compared with every word in the collection of words for each sense of a target word to count the overlaps. Moreover, the numbers of examples for each word in our sample Nepali WordNet have been increased to four in average. The experiments performed on 348 words including 59 polysemy words shows the accuracy level of our algorithm to be 88.05%.
Keywords
natural language processing; Nepali language; context window; context word collection; context word example; context word gloss; context word hypernym; context word synset; modified adapted Lesk algorithm; polysemy words; sample Nepali WordNet; word sense disambiguation; Accuracy; Context; Dictionaries; Gold; Natural language processing; Solids; Context Word; Hypernymy; Synset; Target Word; Word Sense Disambiguation; WordNet;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Information and Communication Technology and it's Applications (DICTAP), 2014 Fourth International Conference on
Conference_Location
Bangkok
Print_ISBN
978-1-4799-3723-3
Type
conf
DOI
10.1109/DICTAP.2014.6821655
Filename
6821655
Link To Document