DocumentCode
2816074
Title
The modulation spectrogram: in pursuit of an invariant representation of speech
Author
Greenberg, Steven ; Kingsbury, Brian E D
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
Volume
3
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1647
Abstract
Understanding the human ability to reliably process and decode speech across a wide range of acoustic conditions and speaker characteristics is a fundamental challenge for current theories of speech perception. Conventional speech representations such as the sound spectrogram emphasize many spectro-temporal details that are not directly germane to the linguistic information encoded in the speech signal and which consequently do not display the perceptual stability characteristic of human listeners. We propose a new representational format, the modulation spectrogram, that discards much of the spectro-temporal detail in the speech signal and instead focuses on the underlying, stable structure incorporated in the low-frequency portion of the modulation spectrum distributed across critical-band-like channels. We describe the representation and illustrate its stability with color-mapped displays and with results from automatic speech recognition experiments
Keywords
modulation; signal representation; spectral analysis; speech processing; stability; acoustic conditions; automatic speech recognition experiments; color-mapped displays; critical band like channels; human listeners; invariant speech representation; linguistic information; modulation spectrogram; perceptual stability characteristic; sound spectrogram; speaker characteristics; speech decoding; speech perception; speech representation format; speech signal; speech understanding; Automatic speech recognition; Decoding; Finite impulse response filter; Frequency; Humans; Loudspeakers; Low pass filters; Spectrogram; Speech coding; Stability;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.598826
Filename
598826
Link To Document