DocumentCode :
1184281
Title :
RASTA processing of speech
Author :
Hermansky, Hynek ; Morgan, Nelson
Author_Institution :
Oregon Graduate Inst., Portland, OR, USA
Volume :
2
Issue :
4
fYear :
1994
fDate :
10/1/1994 12:00:00 AM
Firstpage :
578
Lastpage :
589
Abstract :
Performance of even the best current stochastic recognizers severely degrades in an unexpected communications environment. In some cases, the environmental effect can be modeled by a set of simple transformations and, in particular, by convolution with an environmental impulse response and the addition of some environmental noise. Often, the temporal properties of these environmental effects are quite different from the temporal properties of speech. We have been experimenting with filtering approaches that attempt to exploit these differences to produce robust representations for speech recognition and enhancement and have called this class of representations relative spectra (RASTA). In this paper, we review the theoretical and experimental foundations of the method, discuss the relationship with human auditory perception, and extend the original method to combinations of additive noise and convolutional noise. We discuss the relationship between RASTA features and the nature of the recognition models that are required and the relationship of these features to delta features and to cepstral mean subtraction. Finally, we show an application of the RASTA technique to speech enhancement
Keywords :
filtering and prediction theory; hearing; speech analysis and processing; speech recognition; transient response; RASTA processing; cepstral mean subtraction; communications environment; convolutional noise; delta features; environmental impulse response; environmental noise; filtering approaches; human auditory perception; recognition models; relative spectra; robust representations; speech enhancement; speech processing; stochastic recognizers; temporal properties; Additive noise; Convolution; Degradation; Filtering; Humans; Noise robustness; Speech processing; Speech recognition; Stochastic resonance; Working environment noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.326616
Filename :
326616
Link To Document :
بازگشت