DocumentCode
2770404
Title
The LIMSI QAst systems: Comparison between human and automatic rules generation for question-answering on speech transcriptions
Author
Rosset, Sophie ; Galibert, Olivier ; Adda, Gilles ; Bilinski, Eric
Author_Institution
LIMSI-CNRS, Orsay
fYear
2007
fDate
9-13 Dec. 2007
Firstpage
647
Lastpage
652
Abstract
In this paper, we present two different question-answering systems on speech transcripts. These two systems are based on a complete and multi-level analysis of both queries and documents. The first system uses handcrafted rules for small text fragments (snippet) selection and answer extraction. The second one replaces the handcrafting with an automatically generated research descriptor. A score based on those descriptors is used to select documents and snippets. The extraction and scoring of candidate answers is based on proximity measurements within the research descriptor elements and a number of secondary factors. The preliminary results obtained on QAst (QA on speech transcripts) development data are promising ranged from 72% correct answer at 1 st rank on manually transcribed meeting data to 94% on manually transcribed lecture data.
Keywords
information retrieval; speech processing; text analysis; Limsi QAst system; question-answering system; speech transcription; text fragment selection; Data mining; Humans; Information analysis; Information retrieval; Natural languages; Performance analysis; Search engines; Seminars; Speech processing; Speech recognition; Question answering; speech recognition of meetings and lectures;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location
Kyoto
Print_ISBN
978-1-4244-1746-9
Electronic_ISBN
978-1-4244-1746-9
Type
conf
DOI
10.1109/ASRU.2007.4430188
Filename
4430188
Link To Document