• DocumentCode
    2770404
  • Title

    The LIMSI QAst systems: Comparison between human and automatic rules generation for question-answering on speech transcriptions

  • Author

    Rosset, Sophie ; Galibert, Olivier ; Adda, Gilles ; Bilinski, Eric

  • Author_Institution
    LIMSI-CNRS, Orsay
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    647
  • Lastpage
    652
  • Abstract
    In this paper, we present two different question-answering systems on speech transcripts. These two systems are based on a complete and multi-level analysis of both queries and documents. The first system uses handcrafted rules for small text fragments (snippet) selection and answer extraction. The second one replaces the handcrafting with an automatically generated research descriptor. A score based on those descriptors is used to select documents and snippets. The extraction and scoring of candidate answers is based on proximity measurements within the research descriptor elements and a number of secondary factors. The preliminary results obtained on QAst (QA on speech transcripts) development data are promising ranged from 72% correct answer at 1 st rank on manually transcribed meeting data to 94% on manually transcribed lecture data.
  • Keywords
    information retrieval; speech processing; text analysis; Limsi QAst system; question-answering system; speech transcription; text fragment selection; Data mining; Humans; Information analysis; Information retrieval; Natural languages; Performance analysis; Search engines; Seminars; Speech processing; Speech recognition; Question answering; speech recognition of meetings and lectures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430188
  • Filename
    4430188