• DocumentCode
    3248813
  • Title

    A Hybrid Oriya Named Entity Recognition System: Integrating HMM with MaxEnt

  • Author

    Biswas, Sitanath ; Mohanty, S. ; Mishra, S.P.

  • Author_Institution
    Dept. of IT, SOA Univ., Bhubaneswar, India
  • fYear
    2009
  • fDate
    16-18 Dec. 2009
  • Firstpage
    639
  • Lastpage
    643
  • Abstract
    This paper describes a hybrid system that applies maximum entropy (MaxEnt) model with hidden Markov model (HMM) and some linguistic rules to recognize name entities in Oriya language. The main advantage of our system is, we are using both HMM and MaxEnt model successively with some manually developed linguistic rules. First we are using MaxEnt to identify name entities in Oria corpus, then tagging them temporary as reference. The tagged corpus of MaxEnt now regarded as a training process for HMM. Now we use HMM for final tagging. Our approach can achieve higher precision and recall, when providing enough training data and appropriate error correction mechanism.
  • Keywords
    computational linguistics; error correction; hidden Markov models; maximum entropy methods; natural language processing; HMM; MaxEnt; Oriya language; error correction mechanism; hidden Markov model; hybrid Oriya named entity recognition system; linguistic rules; maximum entropy; Entropy; Error correction; Hidden Markov models; Morphology; Semiconductor optical amplifiers; Support vector machine classification; Support vector machines; Tagging; Time measurement; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Trends in Engineering and Technology (ICETET), 2009 2nd International Conference on
  • Conference_Location
    Nagpur
  • Print_ISBN
    978-1-4244-5250-7
  • Electronic_ISBN
    978-0-7695-3884-6
  • Type

    conf

  • DOI
    10.1109/ICETET.2009.10
  • Filename
    5395485