Title :
A Hybrid Oriya Named Entity Recognition System: Integrating HMM with MaxEnt
Author :
Biswas, Sitanath ; Mohanty, S. ; Mishra, S.P.
Author_Institution :
Dept. of IT, SOA Univ., Bhubaneswar, India
Abstract :
This paper describes a hybrid system that applies maximum entropy (MaxEnt) model with hidden Markov model (HMM) and some linguistic rules to recognize name entities in Oriya language. The main advantage of our system is, we are using both HMM and MaxEnt model successively with some manually developed linguistic rules. First we are using MaxEnt to identify name entities in Oria corpus, then tagging them temporary as reference. The tagged corpus of MaxEnt now regarded as a training process for HMM. Now we use HMM for final tagging. Our approach can achieve higher precision and recall, when providing enough training data and appropriate error correction mechanism.
Keywords :
computational linguistics; error correction; hidden Markov models; maximum entropy methods; natural language processing; HMM; MaxEnt; Oriya language; error correction mechanism; hidden Markov model; hybrid Oriya named entity recognition system; linguistic rules; maximum entropy; Entropy; Error correction; Hidden Markov models; Morphology; Semiconductor optical amplifiers; Support vector machine classification; Support vector machines; Tagging; Time measurement; Training data;
Conference_Titel :
Emerging Trends in Engineering and Technology (ICETET), 2009 2nd International Conference on
Conference_Location :
Nagpur
Print_ISBN :
978-1-4244-5250-7
Electronic_ISBN :
978-0-7695-3884-6
DOI :
10.1109/ICETET.2009.10