• DocumentCode
    552445
  • Title

    Chinese ner hybrid pattern based on multi-feature fusion

  • Author

    Zhang, Yue-jie ; Wu, Wei ; Jin, Cheng ; Zhang, Tao

  • Author_Institution
    Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
  • Volume
    1
  • fYear
    2011
  • fDate
    10-13 July 2011
  • Firstpage
    5
  • Lastpage
    9
  • Abstract
    This paper focuses on the Chinese Named Entity Recognition (NER) hybrid pattern, and emphasizes particularly on the fusion mechanism of multiple features for NE acquisition. It differentiates from most of previous methods mainly as that Local Features and Global Features are integrated to get higher performance. Meanwhile, to reduce search space and improve processing efficiency, Heuristic Human Knowledge is introduced into the statistical model, which could increase the performance significantly. From the experimental results on data sets of People´s Daily and NER Task in SIGHAN2008, it can be concluded that our hybrid model based on multi-feature fusion is an effective NER pattern to combine statistical model and heuristic human knowledge.
  • Keywords
    information retrieval; natural language processing; sensor fusion; statistical analysis; Chinese NER hybrid pattern; NE acquisition; global features; heuristic human knowledge; local features; multifeature fusion mechanism; named entity recognition; natural language processing; processing efficiency improvement; search space reduction; statistical model; Data models; Educational institutions; Feature extraction; Humans; Machine learning; Reliability; Training; Conditional Random Field (CRF); Maximum Entropy (ME); Named Entity Recognition (NER); multi-feature fusion; reliability evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
  • Conference_Location
    Guilin
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4577-0305-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2011.6016675
  • Filename
    6016675