DocumentCode
3339845
Title
Speaker- and language-independent speech recognition in mobile communication systems
Author
Viikki, Olli ; Kiss, Imre ; Tian, Jilei
Author_Institution
Speech & Audio Syst. Lab., Nokia Res. Center, Tampere, Finland
Volume
1
fYear
2001
fDate
2001
Firstpage
5
Abstract
We investigate the technical challenges that are faced when making a transition from the speaker-dependent to speaker-independent speech recognition technology in mobile communication devices. Due to globalization as well as the international nature of the markets and the future applications, speaker independence implies the development and use of language-independent automatic speech recognition (ASR) to avoid logistic difficulties. We propose an architecture for embedded multilingual speech recognition systems. Multilingual acoustic modeling, automatic language identification, and on-line pronunciation modeling are the key features which enable the creation of truly language- and speaker-independent ASR applications with dynamic vocabularies and sparse implementation resources. Our experimental results confirm the viability of the proposed architecture. While the use of multilingual acoustic models degrades the recognition rates only marginally, a recognition accuracy decrease of approximately 4% is observed due to sub-optimal on-line text-to-phoneme mapping and automatic language identification. This performance loss can nevertheless be compensated by applying acoustic model adaptation techniques
Keywords
cellular radio; hidden Markov models; natural languages; speech recognition; acoustic model adaptation techniques; automatic language identification; dynamic vocabularies; language-independent speech recognition; mobile communication systems; multilingual acoustic modeling; multilingual speech recognition systems; online pronunciation modeling; recognition accuracy; sparse implementation resources; speaker-independent speech recognition; technical challenges; text-to-phoneme mapping; Acoustic applications; Acoustic devices; Automatic speech recognition; Communications technology; Globalization; Logistics; Loudspeakers; Mobile communication; Natural languages; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940753
Filename
940753
Link To Document