Title :
A Hybrid Model for Computational Morphology Application
Author :
Yang, Xu ; Hou-Feng, Wang
Author_Institution :
Peking Univ., Beijing
fDate :
July 30 2007-Aug. 1 2007
Abstract :
Computational morphology is a core component in many different types of natural language processing, such as the alignment techniques. This paper describes a method for morphological processing. Based on both rules and statistical models, a lemmatizer is constructed to analyze the English inflectional morphology, and automatically derives the lemmas of the words. The rule model incorporates data from various corpora, machine-readable dictionaries, and an empirical metamorphose rule set, and the statistical model applies mainly the maximum entropy principles to deal with unknown words and ambiguous cases effectively. The knowledge used in our lemmatizer is convenient to update to support the development of natural language processing. Experiments show that the lemmatizer has a wide coverage and high accuracy.
Keywords :
computational linguistics; maximum entropy methods; natural language processing; English inflectional morphology; alignment techniques; computational morphology application; hybrid model; language lemmatizer; maximum entropy principles; natural language processing; rule model; statistical model; word lemmas; Artificial intelligence; Computational modeling; Computer applications; Computer networks; Dictionaries; Distributed computing; Entropy; Morphology; Natural language processing; Software engineering;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-2909-7
DOI :
10.1109/SNPD.2007.34