DocumentCode :
2424337
Title :
Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems
Author :
Ou, Jianlin ; Cai, Jun ; Lin, Qian
Author_Institution :
Dept. of Cognitive Sci., Xiamen Univ., Xiamen
fYear :
2008
fDate :
7-9 July 2008
Firstpage :
123
Lastpage :
127
Abstract :
Most state-of-the-art LVCSR systems are based on continuous density HMMs, which are typically implemented using Gaussian mixture distributions. Such statistical modeling systems usually operate slower than real-time, largely because of the heavy computational overhead of the likelihood computation. The objective of our research is to investigate application of modern SIMD technology to speed up the likelihood computation without degrading the recognition accuracy. In this paper, the likelihood computation of continuous density HMMs is analyzed to show that the conventional way of sequential computing is time-consuming and the likelihood computation itself can be implemented in parallel. A SIMD-based algorithm which can carry out parallel likelihood computation is presented in this paper. Likelihood computation modules in HTK3.4 toolkit have been modified with SIMD instructions to implement this algorithm. Experiments on TIMIT and WSJ0 corpora show that the SIMD-based data-level parallelism can significantly reduce the time overhead for likelihood computation.
Keywords :
Gaussian distribution; hidden Markov models; maximum likelihood estimation; parallel processing; speech recognition; vocabulary; Gaussian mixture distributions; HTK3.4 toolkit; SIMD technology; TIMIT; WSJ0 corpora; continuous density hidden Markov model; data-level parallelism; large vocabulary continuous speech recognition system; parallel likelihood computation; sequential computing; statistical modeling systems; Computer aided instruction; Computer architecture; Concurrent computing; Degradation; Hidden Markov models; Parallel processing; Real time systems; Registers; Speech recognition; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
Type :
conf
DOI :
10.1109/ICALIP.2008.4590086
Filename :
4590086
Link To Document :
بازگشت