DocumentCode :
3300470
Title :
Automatic identification of non-anaphoric anaphora in spoken dialog
Author :
Fei, Zhongchao ; Huang, Xuanjing ; Weng, Fuliang
Author_Institution :
Dept. of Comput. Sci., Fudan Univ., Shanghai
fYear :
2008
fDate :
19-22 Oct. 2008
Firstpage :
1
Lastpage :
6
Abstract :
Identification of non-anaphoric anaphora is an important step towards a full anaphora resolution. In this paper, we present an automatic identification approach for this task. In our work, some novel features are proposed, which are based on dependency grammars, surrounding words and their POS tags. All the features are automatically extracted using a part-of-speech (POS) tagger and a dependency parser. Our experiments are on a commonly available dialogue corpus, Trains-93. Several machine learning algorithms are used in the experiments, including CME, CRF and SVM. Results show that compared to the approaches used in the previous work, our algorithm is simpler and achieves a higher accuracy.
Keywords :
computational linguistics; text analysis; anaphora resolution; automatic nonanaphoric anaphora identification; part-of-speech tagger; spoken dialog; Classification tree analysis; Computer science; Decision trees; Feature extraction; Machine learning algorithms; Pattern matching; Statistics; Support vector machine classification; Support vector machines; Spoken dialog; anaphora resolution; non-anaphoric anaphora identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4515-8
Electronic_ISBN :
978-1-4244-2780-2
Type :
conf
DOI :
10.1109/NLPKE.2008.4906761
Filename :
4906761
Link To Document :
بازگشت