DocumentCode
3648915
Title
Dagger: The Slovak morphological classifier
Author
Daniel Hládek;Ján Staš;Jozef Juhár
Author_Institution
Department of Electronics and Multimedia Communications, Technical University of Koš
fYear
2012
Firstpage
195
Lastpage
198
Abstract
This paper proposes a classifier, based on hidden Markov model that can be used for solving the problem of part-of-speech tagging of the Slavic languages, such as Slovak, Czech or Polish. These languages are highly inflectional and morphologically rich and have a very large vocabulary. The probability matrices of the classical hidden Markov model are linearly interpolated with additional probability matrices that are calculated using a suffix-based word clustering function. The search space is restricted by a morphological dictionary.
Keywords
"Hidden Markov models","Probability","Tagging","Training","Dictionaries","Mathematical model","Equations"
Publisher
ieee
Conference_Titel
ELMAR, 2012 Proceedings
ISSN
1334-2630
Print_ISBN
978-1-4673-1243-1
Type
conf
Filename
6338504
Link To Document