Title :
Towards Word Sense Disambiguation of Polish
Author :
Dominik Bas;Bartosz Broda;Maciej Piasecki
Author_Institution :
Institute of Applied Informatics, Wroclaw University of Technology, Poland
Abstract :
We compare three different methods of Word Sense Disambiguation applied to the disambiguation of a selected set of 13 Polish words. The selected words express different problems for sense disambiguation. As it is hard to find works for Polish in this area, our goal was to analyse applicability and limitations of known methods in relation to Polish and Polish language resources and tools. The obtained results are very positive, as using limited resources, we achieved the accuracy of sense disambiguation greatly exceeding the baseline of the most frequent sense. For the needs of experiments a small corpus of representative examples was manually collected and annotated with senses drawn from plWordNet. Different representations of context of word occurrences were also experimentally tested. Examples of limitations and advantages of the applied methods are discussed.
Keywords :
"Natural languages","Supervised learning","Computer science","Information technology","Informatics","Testing","Automata","Mouth","Tongue","Protection"
Conference_Titel :
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Print_ISBN :
978-83-60810-14-9
DOI :
10.1109/IMCSIT.2008.4747220