DocumentCode :
1653807
Title :
Selecting Answers to Questions from Web Documents by a Robust Validation Process
Author :
Grappy, A. ; Grau, B. ; Falco, M.-H. ; Ligozat, A.-L. ; Robba, I. ; Vilnat, A.
Author_Institution :
LIMSI, Univ. Paris-Sud, Orsay, France
Volume :
1
fYear :
2011
Firstpage :
55
Lastpage :
62
Abstract :
Question answering (QA) systems aim at finding answers to question posed in natural language using a collection of documents. When the collection is extracted from the Web, the structure and style of the texts are quite different from those of newspaper articles. We developed a QA system based on an answer validation process able to handle Web specificity. A large number of candidate answers are extracted from short passages in order to be validated according to question and passages characteristics. The validation module is based on a machine learning approach. It takes into account criteria characterizing both passage and answer relevance at surface, lexical, syntactic and semantic levels to deal with different types of texts. We present and compare results obtained for factual questions posed on a Web and on a newspaper collection. We show that our system outperforms a baseline by up to 48% in MRR.
Keywords :
Internet; feature extraction; learning (artificial intelligence); natural language processing; program verification; question answering (information retrieval); text analysis; QA system; Web document; answer selection; document collection; machine learning; natural language; newspaper article; question answering system; robust validation process; semantic level; Data mining; Feature extraction; HTML; Reliability; Semantics; Springs; Syntactics; Web document analysis; answer validation; fine-grained information retrieval; question-answering system;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
Type :
conf
DOI :
10.1109/WI-IAT.2011.210
Filename :
6040496
Link To Document :
بازگشت