• DocumentCode
    463397
  • Title

    Named Entity Recognition Using Hybrid Machine Learning Approach

  • Author

    Chiong, Raymond ; Wei, Wang

  • Author_Institution
    Sch. of Inf. Technol., Swinburne Univ. of Technol., Sarawak
  • Volume
    1
  • fYear
    2006
  • fDate
    17-19 July 2006
  • Firstpage
    578
  • Lastpage
    583
  • Abstract
    This paper presents a hybrid method using machine learning approach for named entity recognition (NER). A system built based on this method is able to achieve reasonable performance with minimal training data and gazetteers. The hybrid machine learning approach differs from previous machine learning-based systems in that it uses maximum entropy model (MEM) and hidden Markov model (HMM) successively. We report on the performance of our proposed NER system using British National Corpus (BNC). In the recognition process, we first use MEM to identify the named entities in the corpus by imposing some temporary tagging as references. The MEM walkthrough can be regarded as a training process for HMM, as we then use HMM for the final tagging. We show that with enough training data and appropriate error correction mechanism, this approach can achieve higher precision and recall than using a single statistical model We conclude with our experimental results that indicate the flexibility of our system in different domains
  • Keywords
    computational linguistics; hidden Markov models; learning (artificial intelligence); maximum entropy methods; British National Corpus; hidden Markov model; machine learning; maximum entropy model; named entity recognition; Entropy; Error correction; Hidden Markov models; Humans; Information technology; Knowledge based systems; Machine learning; Robustness; Tagging; Training data; Machine learning; named entity recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics, 2006. ICCI 2006. 5th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0475-4
  • Type

    conf

  • DOI
    10.1109/COGINF.2006.365549
  • Filename
    4216466