Title :
Distance metric learning for kernel density-based acoustic model under limited training data conditions
Author :
Van Hai Do;Xiong Xiao;Eng Siong Chng;Haizhou Li
Author_Institution :
School of Computer Engineering, Nanyang Technological University, Singapore
Abstract :
Kernel density model works well for limited training data in acoustic modeling. In this paper, we improve the kernel density-based acoustic model for low resource language speech recognition. In our previous study, we demonstrated the effectiveness of the kernel density-based acoustic model on discriminative features such as cross-lingual bottleneck features. In this paper, we propose to learn a Mahalanobis-based distance, which is equivalent to a full rank linear feature transformation, to minimize training data frame classification error. Experimental results on the Wall Street Journal (WSJ) task show that the proposed Mahalanobis-based distance learning results in significant improvements over the Euclidean distance. The kernel density acoustic model with the Mahalanobis-based distance also outperforms deep neural network acoustic model significantly in limited training data cases.
Keywords :
"Hidden Markov models","Kernel","Training data","Acoustics","Measurement","Data models","Training"
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
DOI :
10.1109/APSIPA.2015.7415373