DocumentCode
3423334
Title
Multilingual weighted codebooks
Author
Raab, Martin ; Gruhn, Rainer ; Noeth, Elmar
Author_Institution
Speech Dialog Syst., Harman Becker Automotive Syst., Ulm
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4257
Lastpage
4260
Abstract
In this paper we present an approach for speech recognition of multiple languages with constrained resources on embedded devices. Examples of such systems are navigation systems, mobile phones and MP3 players. Speech recognizers on such systems are typically to-date semi-continuous speech recognizers, which are based on vector quantization. Typical vector quantization algorithms can only generate vector quantization prototypes that are optimal for one language. We hypothesize and provide evidence that a certain fixed vector quantization is responsible for a significant drop of recognition performance when a recognizer is extended to recognize multiple languages at the same time. This paper proposes an algorithm for the construction of multilingual weighted codebooks (MWCs). These MWCs have the advantage that they offer significantly improved performance for the recognition of multiple languages.
Keywords
embedded systems; natural language processing; speech recognition; speech recognition equipment; vector quantisation; embedded devices; multilingual weighted codebooks; multiple language recognition; speech recognition; vector quantization; Acoustic testing; Automotive engineering; Merging; Natural languages; Navigation; Nearest neighbor searches; Prototypes; Speech recognition; Training data; Vector quantization; codebook; multilingual; semi-continuous;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518595
Filename
4518595
Link To Document