DocumentCode
2283225
Title
Word sense disambiguation in Mongolian language
Author
Bataa, Batzolboo ; Altangerel, Khuder
Author_Institution
Department of Software Engineering, School of Computer Science and Management, Mongolian University of Science and Technology, Ulaanbaatar Mongolia
fYear
2012
fDate
18-21 Sept. 2012
Firstpage
1
Lastpage
4
Abstract
Word sense disambiguation is an important intermediate stage for many natural language processing applications, especially transformation from Cyrillic into Mongolian script. A word sense could be disambiguated by other words in the context as nouns, verbs used with the word. In this research, we have analyzed the result of an experiment on a word disambiguation system for Mongolian language based on statistical model, to which one sense per collocation algorithm is applied, and suggested a model established of the weight of sense rate and the weight of distance to the adjacent words to improve the accuracy. We chose one of 1.7 thousand words which have more than one sense that were used in the experiment, and performed an experiment on 41 thousand words. We researched one of 6.3 thousand words which have a complex stem that can be considered to have another stem plus a suffix structure. Since it is the first work in this field for Mongolian language no previous work results were available for comparison.
Keywords
Bayes methods; classification; natural language processing; text analysis; Bayesian classifier; Cyrillic script; Mongolian language; Mongolian script; natural language processing application; noun; one sense per collocation algorithm; sense rate; statistical model; suffix structure; verb; word sense disambiguation; Accuracy; Context; Dictionaries; Educational institutions; Natural language processing; Training; Bayesian classifier; decision list; one sense per collocation; word sense disambiguation;
fLanguage
English
Publisher
ieee
Conference_Titel
Strategic Technology (IFOST), 2012 7th International Forum on
Conference_Location
Tomsk
Print_ISBN
978-1-4673-1772-6
Type
conf
DOI
10.1109/IFOST.2012.6357625
Filename
6357625
Link To Document