• DocumentCode
    1910044
  • Title

    A sort Approach for Anaphora Resolution of Chinese Personal Pronoun Based on Machine Learning Method

  • Author

    Guan, Jing ; ZHOU, Yanquan ; He, Huacan

  • Author_Institution
    Beijing Univ. of Posts & Telecommun., Beijing
  • fYear
    2007
  • fDate
    Aug. 30 2007-Sept. 1 2007
  • Firstpage
    293
  • Lastpage
    300
  • Abstract
    Anaphora occurs throughout discourse or dialogue. Their high frequencies make anaphora resolution one key problem in discourse processing which attract attention of increasing researchers. In the paper, according to features of Chinese personal pronoun we present an approach which is based on corpus. It adopts maximum entropy method. Then we discovered characteristic of pronoun anaphora referring to inhuman is different form that referring to people. So the paper put forward a new approach about processing the Chinese personal pronoun anaphora. The approach divided the anaphora resolution system into two parts, the PARS subsystem and the IARS subsystem. They will separately process the personal pronoun anaphora referring to people and that referring to inhuman. The paper described the design and the realization of the system, tests the new system in the Chinese Tree Bank and evaluates the arithmetic in the round. The experiment demonstrates that the method achieves the desired result.
  • Keywords
    learning (artificial intelligence); natural language processing; Anaphora resolution; Chinese personal pronoun; machine learning method; maximum entropy method; sort approach; Data mining; Dictionaries; Feature extraction; Frequency; Helium; Learning systems; Machine learning; Natural languages; System testing; Tellurium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-1611-0
  • Electronic_ISBN
    978-1-4244-1611-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2007.4368046
  • Filename
    4368046