مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

1184281

Title :

RASTA processing of speech

Author :

Hermansky, Hynek ; Morgan, Nelson

Author_Institution :

Oregon Graduate Inst., Portland, OR, USA

Volume :

Issue :

fYear :

1994

fDate :

10/1/1994 12:00:00 AM

Firstpage :

578

Lastpage :

589

Abstract :

Performance of even the best current stochastic recognizers severely degrades in an unexpected communications environment. In some cases, the environmental effect can be modeled by a set of simple transformations and, in particular, by convolution with an environmental impulse response and the addition of some environmental noise. Often, the temporal properties of these environmental effects are quite different from the temporal properties of speech. We have been experimenting with filtering approaches that attempt to exploit these differences to produce robust representations for speech recognition and enhancement and have called this class of representations relative spectra (RASTA). In this paper, we review the theoretical and experimental foundations of the method, discuss the relationship with human auditory perception, and extend the original method to combinations of additive noise and convolutional noise. We discuss the relationship between RASTA features and the nature of the recognition models that are required and the relationship of these features to delta features and to cepstral mean subtraction. Finally, we show an application of the RASTA technique to speech enhancement

Keywords :

filtering and prediction theory; hearing; speech analysis and processing; speech recognition; transient response; RASTA processing; cepstral mean subtraction; communications environment; convolutional noise; delta features; environmental impulse response; environmental noise; filtering approaches; human auditory perception; recognition models; relative spectra; robust representations; speech enhancement; speech processing; stochastic recognizers; temporal properties; Additive noise; Convolution; Degradation; Filtering; Humans; Noise robustness; Speech processing; Speech recognition; Stochastic resonance; Working environment noise;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.326616

Filename :

326616

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1184281