Title :
Missing-feature approaches in speech recognition
Author :
Raj, Bhiksha ; Stern, Richard M.
Abstract :
In this article we have reviewed a wide variety of techniques based on the identification of missing spectral features that have proved effective in reducing the error rates of automatic speech recognition systems. These approaches have been conspicuously effective in ameliorating the effects of transient maskers such as impulsive noise or background music. We described two broad classes of missing feature algorithms: feature-vector imputation algorithms (which restore unreliable components of incoming feature vectors) and classifier-modification algorithms (which dynamically reconfigure the classifier itself to cope with the effects of unreliable feature components). We reviewed the mathematics of four major missing feature techniques: the feature-imputation techniques of cluster-based reconstruction and covariance-based reconstruction, and the classifier-modification methods of class-conditional imputation and marginalization. We also discussed the ways in which the common feature extraction procedures of cepstral analysis, temporal-difference features, and mean subtraction can be handled by speech recognition systems that make use of missing feature techniques. We concluded with a discussion of a small number of selected experimental results. These results confirm the effectiveness of all types of missing feature approaches discussed in ameliorating the effects of both stationary and transient noise, as well as the particular effectiveness of both soft masks and fragment decoding.
Keywords :
cepstral analysis; feature extraction; speech recognition; transients; automatic speech recognition systems; background music; cepstral analysis; class-conditional imputation/marginalization; classifier-modification algorithms; cluster-based reconstruction; covariance-based reconstruction; feature extraction; feature-vector imputation algorithms; fragment decoding; impulsive noise; mean subtraction; missing spectral features; missing-feature approach; soft masks; stationary noise; temporal-difference features; transient noise; Automatic speech recognition; Background noise; Cepstral analysis; Clustering algorithms; Decoding; Error analysis; Feature extraction; Heuristic algorithms; Mathematics; Speech recognition;
Journal_Title :
Signal Processing Magazine, IEEE
DOI :
10.1109/MSP.2005.1511828