DocumentCode :
1262
Title :
Efficient Sparse Banded Acoustic Models for Speech Recognition
Author :
Weibin Zhang ; Fung, Pascale
Author_Institution :
Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Volume :
21
Issue :
3
fYear :
2014
fDate :
Mar-14
Firstpage :
280
Lastpage :
283
Abstract :
We propose sparse banded acoustic models to significantly improve the recognition accuracy and reduce the computational cost of speech recognition systems. The sparse banded models are trained using a weighted lasso regularization. In addition, we propose new feature orders to reduce the bandwidth of sparse banded models in order to speed up computation. Experimental results on the Wall Street Journal data set show that sparse banded models significantly outperform diagonal and full covariance models by 9.5% and 15.1% relatively. Sparse banded models also run the fastest. The advantages of sparse banded models are also demonstrated on the collected Cantonese data set.
Keywords :
covariance matrices; sparse matrices; speech recognition; Cantonese data set; Wall Street Journal data set; full covariance model; sparse banded acoustic model; speech recognition accuracy; weighted lasso regularization; Acoustics; Computational modeling; Covariance matrices; Data models; Feature extraction; Hidden Markov models; Sparse matrices; Inverse covariance matrix; sparse banded models; speech recognition;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2013.2292920
Filename :
6675805
Link To Document :
بازگشت