• DocumentCode
    2816074
  • Title

    The modulation spectrogram: in pursuit of an invariant representation of speech

  • Author

    Greenberg, Steven ; Kingsbury, Brian E D

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • Volume
    3
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1647
  • Abstract
    Understanding the human ability to reliably process and decode speech across a wide range of acoustic conditions and speaker characteristics is a fundamental challenge for current theories of speech perception. Conventional speech representations such as the sound spectrogram emphasize many spectro-temporal details that are not directly germane to the linguistic information encoded in the speech signal and which consequently do not display the perceptual stability characteristic of human listeners. We propose a new representational format, the modulation spectrogram, that discards much of the spectro-temporal detail in the speech signal and instead focuses on the underlying, stable structure incorporated in the low-frequency portion of the modulation spectrum distributed across critical-band-like channels. We describe the representation and illustrate its stability with color-mapped displays and with results from automatic speech recognition experiments
  • Keywords
    modulation; signal representation; spectral analysis; speech processing; stability; acoustic conditions; automatic speech recognition experiments; color-mapped displays; critical band like channels; human listeners; invariant speech representation; linguistic information; modulation spectrogram; perceptual stability characteristic; sound spectrogram; speaker characteristics; speech decoding; speech perception; speech representation format; speech signal; speech understanding; Automatic speech recognition; Decoding; Finite impulse response filter; Frequency; Humans; Loudspeakers; Low pass filters; Spectrogram; Speech coding; Stability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.598826
  • Filename
    598826