DocumentCode :
2297986
Title :
Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP
Author :
Gong, Yfan ; Kao, Yu-Hung
Author_Institution :
Texas Instrum. Inc., Dallas, TX, USA
Volume :
6
fYear :
2000
fDate :
2000
Firstpage :
3686
Abstract :
Continuous speech recognition is a resource-intensive algorithm. Commercial dictation software requires more than 10 Mbytes to install on the disk and 32 Mbytes RAM to run the application. A typical embedded system can not afford this much RAM because of its high cost and power consumption; it also lacks disk to store the large amount of static data (e.g. acoustic models). We have been working on optimization of a small vocabulary speech recognizer suitable for implementation on a 16-bit fixed-point DSP. This recognizer supports sophisticated continuous density, tied-mixtures Gaussians, parallel model combination, and a noise-robust utterance detection algorithm. The fixed-point version achieves the same performance as the floating-point version. The algorithm runs real-time on a 100 MHz, 16-bit, fixed-point Texas Instruments TMS320C5410 even for the most challenging continuous digit dialing with hands-free microphone in driving conditions
Keywords :
digital signal processing chips; fixed point arithmetic; microphones; random-access storage; signal detection; speech recognition; telephone sets; 100 MHz; 16 bit; RAM; Texas Instruments TMS320C5410; acoustic models; commercial dictation software; continuous density; continuous digit dialing; disk; driving conditions; embedded system; fixed-point DSP; hands-free microphone; high accuracy speech recognizer; noise-robust utterance detection algorithm; optimization; parallel model combination; performance; real-time algorithm; resource-intensive algorithm; small vocabulary speech recognizer; software design; speaker-independent continuous speech recognizer; tied-mixtures Gaussians; Application software; Costs; Digital signal processing; Embedded system; Energy consumption; Gaussian processes; Noise robustness; Power system modeling; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.860202
Filename :
860202
Link To Document :
بازگشت