Title :
A 40-nm 144-mW VLSI processor for real-time 60-kWord continuous speech recognition
Author :
Guangji He ; Sugahara, Tohru ; Fujinaga, T. ; Miyamoto, Yutaka ; Noguchi, Hiroki ; Izumi, Shintaro ; Kawaguchi, Hitoshi ; Yoshimoto, Masahiko
Author_Institution :
Kobe Univ., Kobe, Japan
Abstract :
We have developed a low-power VLSI chip for 60-kWord real-time continuous speech recognition based on a context-dependent Hidden Markov Model (HMM). Our implementation includes a cache architecture using locality of speech recognition, beam pruning using a dynamic threshold, two-stage language model searching, highly parallel Gaussian Mixture Model (GMM) computation based on the mixture level, a variable-frame look-ahead scheme, and elastic pipeline operation between the Viterbi transition and GMM processing. Results show that our implementation achieves 95% bandwidth reduction (70.86 MB/s) and 78% required frequency reduction (126.5 MHz). The test chip, fabricated using 40 nm CMOS technology, contains 1.9 M transistors for logic and 7.8 Mbit on-chip memory. It dissipates 144 mW at 126.5 MHz and 1.1 V for 60 kWord real-time continuous speech recognition.
Keywords :
CMOS integrated circuits; VLSI; hidden Markov models; real-time systems; speech recognition; 60 kWord real-time continuous speech recognition; CMOS technology; GMM processing; HMM; VLSI processor; Viterbi transition; bandwidth reduction; beam pruning; cache architecture; context-dependent hidden Markov model; dynamic threshold; elastic pipeline operation; highly parallel Gaussian mixture model; low-power VLSI chip; on-chip memory; power 144 mW; real-time 60-kWord continuous speech recognition; size 40 nm; two-stage language model searching; variable-frame look-ahead scheme; Frequency measurement; Hidden Markov models; Power demand; Random access memory; Real-time systems; Speech recognition; Viterbi algorithm;
Conference_Titel :
Design Automation Conference (ASP-DAC), 2013 18th Asia and South Pacific
Conference_Location :
Yokohama
Print_ISBN :
978-1-4673-3029-9
DOI :
10.1109/ASPDAC.2013.6509561