Title :
Efficient Sparse Banded Acoustic Models for Speech Recognition
Author :
Weibin Zhang ; Fung, Pascale
Author_Institution :
Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Abstract :
We propose sparse banded acoustic models to significantly improve the recognition accuracy and reduce the computational cost of speech recognition systems. The sparse banded models are trained using a weighted lasso regularization. In addition, we propose new feature orders to reduce the bandwidth of sparse banded models in order to speed up computation. Experimental results on the Wall Street Journal data set show that sparse banded models significantly outperform diagonal and full covariance models by 9.5% and 15.1% relatively. Sparse banded models also run the fastest. The advantages of sparse banded models are also demonstrated on the collected Cantonese data set.
Keywords :
covariance matrices; sparse matrices; speech recognition; Cantonese data set; Wall Street Journal data set; full covariance model; sparse banded acoustic model; speech recognition accuracy; weighted lasso regularization; Acoustics; Computational modeling; Covariance matrices; Data models; Feature extraction; Hidden Markov models; Sparse matrices; Inverse covariance matrix; sparse banded models; speech recognition;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2013.2292920