DocumentCode
3334223
Title
Question Classification in English-Chinese Cross-Language Question Answering: An Integrated Genetic Algorithm and Machine Learning Approach
Author
Day, Min-Yuh ; Ong, Chorng-Shyong ; Hsu, Wen-Lian
Author_Institution
Acad. Sinica, Taipei
fYear
2007
fDate
13-15 Aug. 2007
Firstpage
203
Lastpage
208
Abstract
Question classification plays an important role in cross-language question answering (CLQA) systems, while question Informer plays a key role in enhancing question classification for factual question answering. In this paper, we propose an integrated genetic algorithm (GA) and machine learning (ML) approach for question classification in English-Chinese cross-language question answering. To enhance question informer prediction, we use a hybrid method that integrates GA and conditional random fields (CRF) to optimize feature subset selection in a CRF-based question informer prediction model. The proposed approach extends cross-language question classification by using the GA-CRF question informer feature with support vector machines (SVM). The results of evaluations on the NTCIR-6 CLQA question sets demonstrate the efficacy of the approach in improving the accuracy of question classification in English-Chinese cross-language question answering.
Keywords
classification; genetic algorithms; learning (artificial intelligence); support vector machines; English-Chinese cross-language question answering; SVM; conditional random fields; cross-language question answering systems; factual question answering; genetic algorithm; machine learning; question classification; support vector machines; Cities and towns; Genetic algorithms; Information management; Information science; Machine learning; Natural languages; Optimization methods; Predictive models; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location
Las Vegas, IL
Print_ISBN
1-4244-1500-4
Electronic_ISBN
1-4244-1500-4
Type
conf
DOI
10.1109/IRI.2007.4296621
Filename
4296621
Link To Document