• DocumentCode
    3262207
  • Title

    Disambiguate Chinese personal pronoun based on semantic structure

  • Author

    Wei, Xiangfeng ; Zang, Hanfen ; Zhang, Quan

  • Author_Institution
    Inst. of Acoust., Chinese Acad. of Sci., Beijing
  • fYear
    2008
  • fDate
    26-28 Aug. 2008
  • Firstpage
    644
  • Lastpage
    648
  • Abstract
    It is a very difficult problem in natural language processing to resolve the ambiguity of personal pronouns anaphora in a sentence or paragraph by computer according to semantic expression. Firstly, this paper focuses on finding out the personal names based on the maximal entropy model. Secondly, it categorizes Semantic Chunks and Sentence Category(SCs) based on the HNC theory. Thirdly, we chose 40 paragraphs in 2004 Athens Olympics as training corpus to make up the personal pronouns disambiguating rules. Finally, we chose another 40 paragraphs to exam the rules and processing steps by simulating computerpsilas processing manually. We got a very high precision. Therefore, based on parallel semantic structure of HNC, the approach is effective for the disambiguating of Chinese personal pronouns.
  • Keywords
    computational linguistics; maximum entropy methods; natural language processing; HNC theory; disambiguate Chinese personal pronoun anaphora; maximal entropy model; natural language processing; parallel semantic structure expression; semantic chunk; sentence category; Acoustics; Appraisal; Character recognition; Computational modeling; Computer simulation; Data mining; Entropy; Natural language processing; Robustness; Statistical distributions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing, 2008. GrC 2008. IEEE International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4244-2512-9
  • Electronic_ISBN
    978-1-4244-2513-6
  • Type

    conf

  • DOI
    10.1109/GRC.2008.4664717
  • Filename
    4664717