Automatic identification of non-anaphoric anaphora in spoken dialog

Author

Fei, Zhongchao ; Huang, Xuanjing ; Weng, Fuliang

Author_Institution

Dept. of Comput. Sci., Fudan Univ., Shanghai

fYear

2008

fDate

19-22 Oct. 2008

Firstpage

1

Lastpage

6

Abstract

Identification of non-anaphoric anaphora is an important step towards a full anaphora resolution. In this paper, we present an automatic identification approach for this task. In our work, some novel features are proposed, which are based on dependency grammars, surrounding words and their POS tags. All the features are automatically extracted using a part-of-speech (POS) tagger and a dependency parser. Our experiments are on a commonly available dialogue corpus, Trains-93. Several machine learning algorithms are used in the experiments, including CME, CRF and SVM. Results show that compared to the approaches used in the previous work, our algorithm is simpler and achieves a higher accuracy.

Keywords

computational linguistics; text analysis; anaphora resolution; automatic nonanaphoric anaphora identification; part-of-speech tagger; spoken dialog; Classification tree analysis; Computer science; Decision trees; Feature extraction; Machine learning algorithms; Pattern matching; Statistics; Support vector machine classification; Support vector machines; Spoken dialog; anaphora resolution; non-anaphoric anaphora identification;

fLanguage

English

Publisher

ieee

Conference_Titel

Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on

Conference_Location

Beijing

Print_ISBN

978-1-4244-4515-8

Electronic_ISBN

978-1-4244-2780-2

Type

conf

DOI

10.1109/NLPKE.2008.4906761

Filename

4906761