DocumentCode
535046
Title
A multi-stream speech recognition system based on the estimation of stream weights
Author
Guo, Hongyu ; Chen, Qinghai ; Huang, Dongmei ; Guo, Hongyu ; Zhao, Xiaoqun
Author_Institution
Sch. of Inf., Shanghai Ocean Univ., Shanghai, China
Volume
7
fYear
2010
fDate
16-18 Oct. 2010
Firstpage
3479
Lastpage
3482
Abstract
A multi-stream speech recognition framework based on the estimation of stream weights is proposed for robust speech recognition. First, two complementary acoustic features, MFCCs and LPCCs, were selected. Second, we modeled them separately by using Hidden Markov Models (HMMs), furthermore, formed two streams of this system. Last, we combined the likelihood outputs of the above two systems with weighting technique and obtained a better performance. Here we present a novel algorithm for computing the stream weights of the two feature streams based on the computation of intra-and inter-class distances. Experimental results obtained on Chinese Academy of Science speech database show that this system yields better recognition performance in all conditions. Using this multi-stream framework, we found that the word error rate was decreased by 5%.
Keywords
hidden Markov models; speech recognition; complementary acoustic features; hidden Markov models; multi-stream speech recognition system; speech database; stream weights estimation; Computational modeling; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; hidden markov models; linear predictive cepstral coefficient; mel-frequency cepstral coefficient; multi-stream framework; speech recognition; stream weights;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2010 3rd International Congress on
Conference_Location
Yantai
Print_ISBN
978-1-4244-6513-2
Type
conf
DOI
10.1109/CISP.2010.5646753
Filename
5646753
Link To Document