DocumentCode
3630280
Title
Towards Word Sense Disambiguation of Polish
Author
Dominik Bas;Bartosz Broda;Maciej Piasecki
Author_Institution
Institute of Applied Informatics, Wroclaw University of Technology, Poland
fYear
2008
Firstpage
73
Lastpage
78
Abstract
We compare three different methods of Word Sense Disambiguation applied to the disambiguation of a selected set of 13 Polish words. The selected words express different problems for sense disambiguation. As it is hard to find works for Polish in this area, our goal was to analyse applicability and limitations of known methods in relation to Polish and Polish language resources and tools. The obtained results are very positive, as using limited resources, we achieved the accuracy of sense disambiguation greatly exceeding the baseline of the most frequent sense. For the needs of experiments a small corpus of representative examples was manually collected and annotated with senses drawn from plWordNet. Different representations of context of word occurrences were also experimentally tested. Examples of limitations and advantages of the applied methods are discussed.
Keywords
"Natural languages","Supervised learning","Computer science","Information technology","Informatics","Testing","Automata","Mouth","Tongue","Protection"
Publisher
ieee
Conference_Titel
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Print_ISBN
978-83-60810-14-9
Type
conf
DOI
10.1109/IMCSIT.2008.4747220
Filename
4747220
Link To Document