مرکز منطقه ای اطلاع رساني علوم و فناوري - FPGA-based implementation of a real-time 5000-word continuous speech recognizer

DocumentCode :

699768

Title :

FPGA-based implementation of a real-time 5000-word continuous speech recognizer

Author :

Young-kyu Choi ; Kisun You ; Wonyong Sung

Author_Institution :

Sch. of Electr. Eng., Seoul Nat. Univ., Seoul, South Korea

fYear :

2008

fDate :

25-29 Aug. 2008

Firstpage :

Lastpage :

Abstract :

We have developed a hidden Markov model based 5000-word speaker independent continuous speech recognizer using a Field-Programmable Gate Array (FPGA). The feature extraction is conducted in software on a soft-core based CPU, while the emission probability computation and the Viterbi beam search are implemented using parallel and pipelined hardware blocks. In order to reduce the bandwidth requirement to external DRAM, we employed bit-width reduction of the Gaussian parameters, multi-block computation of the emission probability, and two-stage language model pruning. These optimizations reduce the memory bandwidth requirement for emission probability computation and inter-word transition by 81% and 44%, respectively. The speech recognition hardware was synthesized for the Virtex-4 FPGA, and it operates at 100MHz. The experimental result on Wall Street Journal 5k vocabulary task shows that the developed system runs 1.52 times faster than real-time.

Keywords :

DRAM chips; Gaussian processes; feature extraction; field programmable gate arrays; hidden Markov models; optimisation; probability; speech recognition; DRAM; Gaussian parameters; Virtex-4 FPGA; Viterbi beam search; Wall Street Journal vocabulary task; emission probability computation; feature extraction; field programmable gate array; hidden Markov model; interword transition; memory bandwidth; soft-core based CPU; speaker independent continuous speech recognizer; speech recognition hardware; two-stage language model; Bandwidth; Computational modeling; Computer architecture; Hidden Markov models; Random access memory; Speech; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing Conference, 2008 16th European

Conference_Location :

Lausanne

ISSN :

2219-5491

Type :

conf

Filename :

7080300

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=699768