DocumentCode :
1135508
Title :
Subband Correlation and Robust Speech Recognition
Author :
McAuley, James ; Ming, Ji ; Stewart, Darryl ; Hanna, Philip
Author_Institution :
Sch. of Comput. Sci., Queen´´s Univ. of Belfast, UK
Volume :
13
Issue :
5
fYear :
2005
Firstpage :
956
Lastpage :
964
Abstract :
This paper investigates the effect of modeling subband correlation for noisy speech recognition. Subband feature streams are assumed to be independent in many subband-based speech recognition systems. However, speech recognition experimental results suggest this assumption is unrealistic. In this paper, a method is proposed to incorporate correlation into subband speech feature streams. In the proposed method, all possible combinations of subbands are created and each combination is treated as a single frequency-band by calculating a single feature vector for it. The resulting feature vectors, therefore, capture information about every band in the combination, as well as the dependency across the bands. Although using the new features results in a higher computational complexity, our experimental results show that they effectively capture the correlation between the subbands while making minimal assumptions about the structure of the correlation. Experiments are conducted on the TIDigits database. The results demonstrate improved accuracy for clean speech recognition and improved robustness in the presence of both stationary and nonstationary band-selective noise, in comparison to a system assuming subband independence.
Keywords :
computational complexity; correlation methods; speech recognition; TIDigits database; computational complexity; noisy speech recognition; nonstationary band-selective noise; single feature vector; single frequency-band; subband correlation; Cepstral analysis; Computational complexity; Data mining; Degradation; Feature extraction; Frequency; Hidden Markov models; Noise robustness; Spatial databases; Speech recognition; Correlation; noise robustness; speech recognition; subband;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2005.851952
Filename :
1495477
Link To Document :
بازگشت