Title :
Modeling sub-band correlation for noise-robust speech recognition
Author :
McAuley, James ; Ming, Ji ; Hanna, Philip ; Stewart, Darryl
Author_Institution :
Sch. of Comput. Sci., Queen´´s Univ., Belfast, UK
Abstract :
The paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Sub-band data streams are assumed to be independent in many sub-band based speech recognition systems. However, the structure and operation of the human vocal tract suggests this assumption is unrealistic. A novel method is proposed to incorporate correlation into sub-band speech feature streams. All possible combinations of sub-bands are created and each combination is treated as a single frequency band by calculating a single feature vector for it. The resulting feature vectors capture information about every band in the combination as well as the dependency across the bands. Experiments conducted on the TIDigits database demonstrate significantly improved robustness in comparison to an independent sub-band system in the presence of both stationary and non-stationary noise.
Keywords :
acoustic noise; cepstral analysis; channel bank filters; correlation methods; discrete cosine transforms; random noise; speech recognition; DCT; TIDigits database; discrete cosine transform; feature vector; filter-bank cepstral analysis; human vocal tract; noise-robust speech recognition; nonstationary noise; speech feature streams; stationary noise; sub-band correlation; Cepstral analysis; Computer science; Discrete cosine transforms; Feature extraction; Frequency; Human voice; Noise robustness; Spatial databases; Speaker recognition; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326161