DocumentCode :
538053
Title :
Semi-automatic extension of morphological lexica
Author :
Kaufmann, Tobias ; Pfister, Beat
Author_Institution :
Speech Process. Group, ETH Zurich, Zurich, Switzerland
fYear :
2010
fDate :
18-20 Oct. 2010
Firstpage :
403
Lastpage :
409
Abstract :
We present a tool that facilitates the efficient extension of morphological lexica. The tool exploits information from a morphological lexicon, a morphological grammar and a text corpus to guide the acquisition process. In particular, it employs statistical models to analyze out-of-vocabulary words and predict lexical information. These models do not require any additional labeled data for training. Furthermore, they are based on generic features that are not specific to any particular language. This paper describes the general design of the tool and evaluates the accuracy of its machine learning components.
Keywords :
computational linguistics; learning (artificial intelligence); natural language processing; statistical analysis; text analysis; machine learning; morphological grammar; morphological lexicon; statistical model; text corpus; words analysis; Accuracy; Compounds; Context; Grammar; Joints; Pragmatics; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location :
Wisla
ISSN :
2157-5525
Print_ISBN :
978-1-4244-6432-6
Type :
conf
DOI :
10.1109/IMCSIT.2010.5679738
Filename :
5679738
Link To Document :
بازگشت