• DocumentCode
    3648915
  • Title

    Dagger: The Slovak morphological classifier

  • Author

    Daniel Hládek;Ján Staš;Jozef Juhár

  • Author_Institution
    Department of Electronics and Multimedia Communications, Technical University of Koš
  • fYear
    2012
  • Firstpage
    195
  • Lastpage
    198
  • Abstract
    This paper proposes a classifier, based on hidden Markov model that can be used for solving the problem of part-of-speech tagging of the Slavic languages, such as Slovak, Czech or Polish. These languages are highly inflectional and morphologically rich and have a very large vocabulary. The probability matrices of the classical hidden Markov model are linearly interpolated with additional probability matrices that are calculated using a suffix-based word clustering function. The search space is restricted by a morphological dictionary.
  • Keywords
    "Hidden Markov models","Probability","Tagging","Training","Dictionaries","Mathematical model","Equations"
  • Publisher
    ieee
  • Conference_Titel
    ELMAR, 2012 Proceedings
  • ISSN
    1334-2630
  • Print_ISBN
    978-1-4673-1243-1
  • Type

    conf

  • Filename
    6338504