• DocumentCode
    2770351
  • Title

    A system for speech driven information retrieval

  • Author

    González-Ferreras, César ; Cardeñoso-Payo, Valentín

  • Author_Institution
    Univ. de Valladolid, Valladolid
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    624
  • Lastpage
    628
  • Abstract
    In this paper we present a system that allows users to search information in a document collection using a spoken query. The system is based on a speech recognizer and on an information retrieval engine. The system works for Spanish language. We evaluated the system using CLEF´01 test set, extended to include spoken queries. We proposed an adaptation of vocabulary and language model, to reduce the out of vocabulary word problem. In order to reduce errors caused by words in a foreign language, we expanded our pronunciation lexicon to include the pronunciation of English words. Experiments showed a relative gain in retrieval precision of 6.34%, a relative reduction in OOV word rate of 24.71% and a relative reduction in WER of 10.87%.
  • Keywords
    information retrieval systems; natural languages; speech recognition; Spanish language; information retrieval engine; language model; speech driven information retrieval; speech recognizer; spoken query; vocabulary model; Adaptation model; Audio recording; Information retrieval; Internet; Microcomputers; Natural languages; Search engines; Speech recognition; System testing; Vocabulary; foreign words modeling; information retrieval; language model adaptation; speech driven information retrieval; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430184
  • Filename
    4430184