• DocumentCode
    1632052
  • Title

    An Approach of Chunk Parsing and Entity Relation Extracting to Chinese Based on Conditional Random Fields Model

  • Author

    Wu, Jun-hua ; ZHOU, Jing

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Nanjing Univ. of Technol., Nanjing
  • Volume
    1
  • fYear
    2008
  • Firstpage
    489
  • Lastpage
    494
  • Abstract
    Conditional random fields (CRFs) model is the valid probabilistic model to segment and label sequence data. Comparing with other statistical models, such as HMM, MEHMM, CRFs process the data sequence in terms of the context of data. Chunk analysis is a shallow parsing method to simplify natural language processing. And entity relation extraction is used in establishing relationship between entities. Because full syntax parsing is complexity in Chinese text understanding chunk analysis and relation extraction is important. This paper models these problems to Chinese text. By transforming them into label solution we can use CRFs to realize the chunk analysis and entities relation extraction. In the paper we define the representation of Chinese chunk and entity relation. The features window of the label word is discussed. By training we obtain an optimized CRFs model. It can realize label to chunk and entity relation so as to complete chunk parsing and relation extracting.
  • Keywords
    natural language processing; text analysis; Chinese text understanding chunk analysis; chunk parsing; entity relation conditional random fields model; entity relation extraction; natural language processing; Data engineering; Data mining; Design engineering; Educational institutions; Hidden Markov models; Information analysis; Information science; Machine learning; Natural languages; Statistics; Chunk Parsing; Entity Relation Extraction; Information Extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2008. ISDA '08. Eighth International Conference on
  • Conference_Location
    Kaohsiung
  • Print_ISBN
    978-0-7695-3382-7
  • Type

    conf

  • DOI
    10.1109/ISDA.2008.225
  • Filename
    4696255