• DocumentCode
    3339845
  • Title

    Speaker- and language-independent speech recognition in mobile communication systems

  • Author

    Viikki, Olli ; Kiss, Imre ; Tian, Jilei

  • Author_Institution
    Speech & Audio Syst. Lab., Nokia Res. Center, Tampere, Finland
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    5
  • Abstract
    We investigate the technical challenges that are faced when making a transition from the speaker-dependent to speaker-independent speech recognition technology in mobile communication devices. Due to globalization as well as the international nature of the markets and the future applications, speaker independence implies the development and use of language-independent automatic speech recognition (ASR) to avoid logistic difficulties. We propose an architecture for embedded multilingual speech recognition systems. Multilingual acoustic modeling, automatic language identification, and on-line pronunciation modeling are the key features which enable the creation of truly language- and speaker-independent ASR applications with dynamic vocabularies and sparse implementation resources. Our experimental results confirm the viability of the proposed architecture. While the use of multilingual acoustic models degrades the recognition rates only marginally, a recognition accuracy decrease of approximately 4% is observed due to sub-optimal on-line text-to-phoneme mapping and automatic language identification. This performance loss can nevertheless be compensated by applying acoustic model adaptation techniques
  • Keywords
    cellular radio; hidden Markov models; natural languages; speech recognition; acoustic model adaptation techniques; automatic language identification; dynamic vocabularies; language-independent speech recognition; mobile communication systems; multilingual acoustic modeling; multilingual speech recognition systems; online pronunciation modeling; recognition accuracy; sparse implementation resources; speaker-independent speech recognition; technical challenges; text-to-phoneme mapping; Acoustic applications; Acoustic devices; Automatic speech recognition; Communications technology; Globalization; Logistics; Loudspeakers; Mobile communication; Natural languages; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940753
  • Filename
    940753