• DocumentCode
    3136653
  • Title

    Robust Self-Training System for Spoken Query Information Retrieval using Pitch Range Variations

  • Author

    Benahmed, Yacine ; Selouani, Sid-Ahmed

  • Author_Institution
    LARIHS Lab., Moncton Univ., NB
  • fYear
    2006
  • fDate
    38838
  • Firstpage
    1450
  • Lastpage
    1453
  • Abstract
    This paper presents an automatic user profile building and training (AUPB&T) system using voice pitch variation for speech recognition engines. The problem with current ASR engines is that their vocabularies are usually only suited for general usage. Another problem with current ASR engines is that there is no easy means for visually challenged users to train the engine to improve its performance. Our proposed solution consists of a system that can accept a user´s document and favorite Web pages. These documents can then be parsed and their words added to the ASR engine´s lexicon. Next, it uses those documents to start an ASR training session. The training can completed automatically by using a high quality text-to-speech (TTS) natural voice. In order to overcome the problem of the limited number of high quality natural TTS voices available, we propose to integrate voice pitch variation during the training phase of AUPB&T, which can cover a broader range of user variability. The results of our experiments using standard ASR and TTS engines show that the AUPB&T system using pitch variation improved the recognition rate for an unknown beta speaker
  • Keywords
    Internet; human computer interaction; information retrieval; search engines; speech recognition; speech synthesis; Web page; automatic user profile building; pitch range variation; self-training system; speech recognition engine; spoken query information retrieval; text-to-speech synthesis; training; Automatic speech recognition; Engines; Information retrieval; Laboratories; Robustness; Speech recognition; Speech synthesis; Uniform resource locators; Vocabulary; Web pages; Human-Computer interaction; Pitch variation; Speech recognition; Text-to-Speech; robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
  • Conference_Location
    Ottawa, Ont.
  • Print_ISBN
    1-4244-0038-4
  • Electronic_ISBN
    1-4244-0038-4
  • Type

    conf

  • DOI
    10.1109/CCECE.2006.277694
  • Filename
    4054677