DocumentCode
3262207
Title
Disambiguate Chinese personal pronoun based on semantic structure
Author
Wei, Xiangfeng ; Zang, Hanfen ; Zhang, Quan
Author_Institution
Inst. of Acoust., Chinese Acad. of Sci., Beijing
fYear
2008
fDate
26-28 Aug. 2008
Firstpage
644
Lastpage
648
Abstract
It is a very difficult problem in natural language processing to resolve the ambiguity of personal pronouns anaphora in a sentence or paragraph by computer according to semantic expression. Firstly, this paper focuses on finding out the personal names based on the maximal entropy model. Secondly, it categorizes Semantic Chunks and Sentence Category(SCs) based on the HNC theory. Thirdly, we chose 40 paragraphs in 2004 Athens Olympics as training corpus to make up the personal pronouns disambiguating rules. Finally, we chose another 40 paragraphs to exam the rules and processing steps by simulating computerpsilas processing manually. We got a very high precision. Therefore, based on parallel semantic structure of HNC, the approach is effective for the disambiguating of Chinese personal pronouns.
Keywords
computational linguistics; maximum entropy methods; natural language processing; HNC theory; disambiguate Chinese personal pronoun anaphora; maximal entropy model; natural language processing; parallel semantic structure expression; semantic chunk; sentence category; Acoustics; Appraisal; Character recognition; Computational modeling; Computer simulation; Data mining; Entropy; Natural language processing; Robustness; Statistical distributions;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4244-2512-9
Electronic_ISBN
978-1-4244-2513-6
Type
conf
DOI
10.1109/GRC.2008.4664717
Filename
4664717
Link To Document