Title :
Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space
Author :
Chakrabartty, Shantanu ; Deng, Yunbin ; Cauwenberghs, Gert
Author_Institution :
Michigan State Univ., East Lansing
Abstract :
The performance of speech recognition systems depends on consistent quality of the speech features across variable environmental conditions encountered during training and evaluation. This paper presents a kernel-based nonlinear predictive coding procedure that yields speech features which are robust to nonstationary noise contaminating the speech signal. Features maximally insensitive to additive noise are obtained by growth transformation of regression functions that span a reproducing kernel Hilbert space (RKHS). The features are normalized by construction and extract information pertaining to higher-order statistical correlations in the speech signal. Experiments with the TI-DIGIT database demonstrate consistent robustness to noise of varying statistics, yielding significant improvements in digit recognition accuracy over identical models trained using Mel-scale cepstral features and evaluated at noise levels between 0 and 30-dB signal-to-noise ratio.
Keywords :
Hilbert spaces; feature extraction; higher order statistics; regression analysis; speech coding; speech recognition; Mel-scale cepstral features; TI-DIGIT database; additive noise; growth transformation; higher-order statistical correlations; kernel-based nonlinear predictive coding; nonstationary noise; regression functions; reproducing kernel Hilbert space; signal-to-noise ratio; speech feature extraction; speech quality; speech recognition systems; Feature extraction; Hilbert space; Kernel; Noise level; Noise robustness; Signal to noise ratio; Speech analysis; Speech coding; Speech enhancement; Speech recognition; Feature extraction; growth transforms; noise robustness; nonlinear signal processing; reproducing kernel Hilbert Space; speaker verification;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.899285