The LIMSI QAst systems: Comparison between human and automatic rules generation for question-answering on speech transcriptions

Author

Rosset, Sophie ; Galibert, Olivier ; Adda, Gilles ; Bilinski, Eric

Author_Institution

LIMSI-CNRS, Orsay

fYear

2007

fDate

9-13 Dec. 2007

Firstpage

647

Lastpage

652

Abstract

In this paper, we present two different question-answering systems on speech transcripts. These two systems are based on a complete and multi-level analysis of both queries and documents. The first system uses handcrafted rules for small text fragments (snippet) selection and answer extraction. The second one replaces the handcrafting with an automatically generated research descriptor. A score based on those descriptors is used to select documents and snippets. The extraction and scoring of candidate answers is based on proximity measurements within the research descriptor elements and a number of secondary factors. The preliminary results obtained on QAst (QA on speech transcripts) development data are promising ranged from 72% correct answer at 1 st rank on manually transcribed meeting data to 94% on manually transcribed lecture data.

Keywords

information retrieval; speech processing; text analysis; Limsi QAst system; question-answering system; speech transcription; text fragment selection; Data mining; Humans; Information analysis; Information retrieval; Natural languages; Performance analysis; Search engines; Seminars; Speech processing; Speech recognition; Question answering; speech recognition of meetings and lectures;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location

Kyoto

Print_ISBN

978-1-4244-1746-9

Electronic_ISBN

978-1-4244-1746-9

Type

conf

DOI

10.1109/ASRU.2007.4430188

Filename

4430188