• DocumentCode
    2253104
  • Title

    A method of Chinese coreference resolution combined multi-features in discourse

  • Author

    Shi, Shu-min ; Huang, He-yan ; Chen, Rui-yang

  • Author_Institution
    Sch. of Comput. Sci. & Techonolgy, Beijing Inst. of Technol., Beijing, China
  • Volume
    3
  • fYear
    2010
  • fDate
    11-14 July 2010
  • Firstpage
    1311
  • Lastpage
    1316
  • Abstract
    Coreference that is a kind of ubiquitous language phenomenon makes the topic more highlighted and the narration more concise and coherent in discourse. Conversely, it leads to ambiguity in Natural Language Processing as well. Coreference resolution is the process that eliminates the indeterminacy caused by coreferential forms. To improve the current system, a method of coreference resolution combined with multi-features, mainly including clause´s or full-sentence´s distance, semantic class, and shorten-form features, is proposed in this paper. Experiments show that those features are valuable and have certain effects on the performance of resolution. It can be verified by both precision and F-measure in Chinese-oriented text discourse. In addition, a novelty point absorbed somewhat domain ontological idea in our previous work and embodied further in this paper is rather than usual manual linguistic rules-based approaches for processing the resolution.
  • Keywords
    natural language processing; text analysis; Chinese coreference resolution; Chinese-oriented text discourse; F-measure; coreferential forms; linguistic rules-based approaches; natural language processing; ubiquitous language phenomenon; Chromium; Decision trees; Machine learning; Semantics; Support vector machine classification; Training; CRF; Coreference resolution; Decision tree; Multi-features; NER;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-1-4244-6526-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2010.5580883
  • Filename
    5580883