Title :
FPGA-based implementation of a real-time 5000-word continuous speech recognizer
Author :
Young-kyu Choi ; Kisun You ; Wonyong Sung
Author_Institution :
Sch. of Electr. Eng., Seoul Nat. Univ., Seoul, South Korea
Abstract :
We have developed a hidden Markov model based 5000-word speaker independent continuous speech recognizer using a Field-Programmable Gate Array (FPGA). The feature extraction is conducted in software on a soft-core based CPU, while the emission probability computation and the Viterbi beam search are implemented using parallel and pipelined hardware blocks. In order to reduce the bandwidth requirement to external DRAM, we employed bit-width reduction of the Gaussian parameters, multi-block computation of the emission probability, and two-stage language model pruning. These optimizations reduce the memory bandwidth requirement for emission probability computation and inter-word transition by 81% and 44%, respectively. The speech recognition hardware was synthesized for the Virtex-4 FPGA, and it operates at 100MHz. The experimental result on Wall Street Journal 5k vocabulary task shows that the developed system runs 1.52 times faster than real-time.
Keywords :
DRAM chips; Gaussian processes; feature extraction; field programmable gate arrays; hidden Markov models; optimisation; probability; speech recognition; DRAM; Gaussian parameters; Virtex-4 FPGA; Viterbi beam search; Wall Street Journal vocabulary task; emission probability computation; feature extraction; field programmable gate array; hidden Markov model; interword transition; memory bandwidth; soft-core based CPU; speaker independent continuous speech recognizer; speech recognition hardware; two-stage language model; Bandwidth; Computational modeling; Computer architecture; Hidden Markov models; Random access memory; Speech; Speech recognition;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne