• DocumentCode
    3630280
  • Title

    Towards Word Sense Disambiguation of Polish

  • Author

    Dominik Bas;Bartosz Broda;Maciej Piasecki

  • Author_Institution
    Institute of Applied Informatics, Wroclaw University of Technology, Poland
  • fYear
    2008
  • Firstpage
    73
  • Lastpage
    78
  • Abstract
    We compare three different methods of Word Sense Disambiguation applied to the disambiguation of a selected set of 13 Polish words. The selected words express different problems for sense disambiguation. As it is hard to find works for Polish in this area, our goal was to analyse applicability and limitations of known methods in relation to Polish and Polish language resources and tools. The obtained results are very positive, as using limited resources, we achieved the accuracy of sense disambiguation greatly exceeding the baseline of the most frequent sense. For the needs of experiments a small corpus of representative examples was manually collected and annotated with senses drawn from plWordNet. Different representations of context of word occurrences were also experimentally tested. Examples of limitations and advantages of the applied methods are discussed.
  • Keywords
    "Natural languages","Supervised learning","Computer science","Information technology","Informatics","Testing","Automata","Mouth","Tongue","Protection"
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
  • Print_ISBN
    978-83-60810-14-9
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2008.4747220
  • Filename
    4747220