DocumentCode :
2296031
Title :
A low cost dynamic vocabulary speech recognizer on a GPP-DSP system
Author :
Kao, Yu-Hung ; Rajasekaran, P.K.
Author_Institution :
Texas Instrum., USA
Volume :
6
fYear :
2000
fDate :
2000
Firstpage :
3215
Abstract :
Continuous speech recognition is a resource-intensive algorithm. Commercial dictation software requires more than 10 Mbytes to install on the disk and 32 Mbytes RAM to run the application. Because of the resource requirement, such a system can not be implemented in a low cost and low power embedded system. We propose a design of dynamic vocabulary speech recognizer that will fit in a DSP-GPP (general purpose processor) architecture. The computation intensive, small footprint recognizer engine runs on the DSP; and the computation non-intensive, larger footprint grammar, dictionary, and model acoustic components resides on the GPP. The recognition models are prepared on the GPP and transferred to the DSP, the interaction among the application, model generation, and recognition modules is minimal. The result is a speech recognition server implemented in a low cost embedded system. The application can dynamically create flexible vocabulary to suit different recognition contexts. It still does not do large vocabulary dictation; however, it provides unlimited recognition contexts with unlimited vocabulary, all these implementable in a low cost embedded system
Keywords :
digital signal processing chips; embedded systems; general purpose computers; speech recognition; GPP-DSP system; RAM; application module; commercial dictation software; continuous speech recognition; dictionary; general purpose processor; grammar; low cost dynamic vocabulary speech recognizer; low cost embedded system; model acoustic components; model generation; recognition contexts; recognition models; recognition module; small footprint recognizer engine; speech recognition server; Application software; Computer architecture; Costs; Dictionaries; Digital signal processing; Embedded system; Engines; Speech processing; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.860084
Filename :
860084
Link To Document :
بازگشت