• DocumentCode
    2773157
  • Title

    Using Text-to-Speech Engine to Improve the Accuracy of a Speech-Enabled Interface

  • Author

    Benahmed, Y. ; Selouani, Sid-Ahmed ; Hamam, H. ; Shaughnessy, D.O.

  • Author_Institution
    Univ. de Moncton, Moncton
  • fYear
    2007
  • fDate
    18-20 Nov. 2007
  • Firstpage
    302
  • Lastpage
    306
  • Abstract
    This paper presents an automatic user profile building and training (AUPB&T) system for speech recognition. This system uses text-to-speech (TTS) voices to improve the language models and the performance of current commercial automatic speech recognition (ASR) engines. The vocabularies of these systems are usually suited for general usage. Users have no easy means of training these engines. They generally shun the proposed training methods that require long and picky training sessions. Our proposed solution is a system that accepts the user documents and favorite Web pages, and feeds them to a (TTS) module in order to improve the accuracy of spoken information retrieval queries. The results show that AUPB&T considerably improves the recognition engine performance of the Microsoft speech recognition system without having to resort to manual training.
  • Keywords
    speech recognition; user interfaces; commercial automatic speech recognition engines; speech recognition; speech-enabled interface; text-to-speech engine; Automatic speech recognition; Dictionaries; Engines; Laboratories; Natural languages; Speech recognition; Speech synthesis; Uniform resource locators; Vocabulary; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Information Technology, 2007. IIT '07. 4th International Conference on
  • Conference_Location
    Dubai
  • Print_ISBN
    978-1-4244-1840-4
  • Electronic_ISBN
    978-1-4244-1841-1
  • Type

    conf

  • DOI
    10.1109/IIT.2007.4430395
  • Filename
    4430395