• DocumentCode
    2375357
  • Title

    Features induction for product named entity recognition with CRFs

  • Author

    Luo, Fang ; Fang, Pei ; Qiu, Qizhi ; Xiao, Han

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Wuhan Univ. of Technol., Wuhan, China
  • fYear
    2012
  • fDate
    23-25 May 2012
  • Firstpage
    491
  • Lastpage
    496
  • Abstract
    A framework for product named entity recognition in Chinese was presented using Conditional Random Fields with multiple features in this paper. It differentiates from most of the previous approaches mainly as follows. Firstly, introducing the domain ontology features to the CRFs model can use its semantic information. Secondly, combining internal and external features to compound features can use more rich overlapping features. so that it can improve the performance of product named entity Recognition. Experimental results show that this approach can achieve an overall F-measure around 87.16%, which seems to achieve the current state-of-the-art performance. However, due to the imperfect of Domain Ontology and the complication of reviews texts, the recognition for product named entity may not be better than the research of the traditional named entity recognition.
  • Keywords
    feature extraction; information retrieval; natural language processing; ontologies (artificial intelligence); random processes; text analysis; CRF; Chinese PNER; F-measure; conditional random fields; domain ontology features; external features; features induction; internal features; performance improvement; product named entity recognition; semantic information; text reviews; Text recognition; Viterbi algorithm; Compound Features; Conditional Random Fields; Domain Ontology; Product Named Entity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Supported Cooperative Work in Design (CSCWD), 2012 IEEE 16th International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4673-1211-0
  • Type

    conf

  • DOI
    10.1109/CSCWD.2012.6221863
  • Filename
    6221863