Title :
The LIMSI QAst systems: Comparison between human and automatic rules generation for question-answering on speech transcriptions
Author :
Rosset, Sophie ; Galibert, Olivier ; Adda, Gilles ; Bilinski, Eric
Author_Institution :
LIMSI-CNRS, Orsay
Abstract :
In this paper, we present two different question-answering systems on speech transcripts. These two systems are based on a complete and multi-level analysis of both queries and documents. The first system uses handcrafted rules for small text fragments (snippet) selection and answer extraction. The second one replaces the handcrafting with an automatically generated research descriptor. A score based on those descriptors is used to select documents and snippets. The extraction and scoring of candidate answers is based on proximity measurements within the research descriptor elements and a number of secondary factors. The preliminary results obtained on QAst (QA on speech transcripts) development data are promising ranged from 72% correct answer at 1 st rank on manually transcribed meeting data to 94% on manually transcribed lecture data.
Keywords :
information retrieval; speech processing; text analysis; Limsi QAst system; question-answering system; speech transcription; text fragment selection; Data mining; Humans; Information analysis; Information retrieval; Natural languages; Performance analysis; Search engines; Seminars; Speech processing; Speech recognition; Question answering; speech recognition of meetings and lectures;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430188