• DocumentCode
    110569
  • Title

    Demodulation of Narrowband Speech Spectrograms Using the Riesz Transform

  • Author

    Aragonda, Haricharan ; Seelamantula, Chandra Sekhar

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    23
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    1824
  • Lastpage
    1834
  • Abstract
    We propose a two-dimensional (2-D) multicomponent amplitude-modulation, frequency-modulation (AM-FM) model for a spectrogram patch corresponding to voiced speech, and develop a new demodulation algorithm to effectively separate the AM, which is related to the vocal tract response, and the carrier, which is related to the excitation. The demodulation algorithm is based on the Riesz transform and is developed along the lines of Hilbert-transform-based demodulation for 1-D AM-FM signals. We compare the performance of the Riesz transform technique with that of the sinusoidal demodulation technique on real speech data. Experimental results show that the Riesz-transform-based demodulation technique represents spectrogram patches accurately. The spectrograms reconstructed from the demodulated AM and carrier are inverted and the corresponding speech signal is synthesized. The signal-to-noise ratio (SNR) of the reconstructed speech signal, with respect to clean speech, was found to be 2 to 4 dB higher in case of the Riesz transform technique than the sinusoidal demodulation technique.
  • Keywords
    Hilbert transforms; amplitude modulation; demodulation; frequency modulation; signal reconstruction; signal synthesis; speech synthesis; 1D AM-FM signal; 2D multicomponent amplitude modulation; Hilbert transform-based demodulation; Riesz transform; frequency modulation model; narrowband speech spectrogram demodulation; signal-to-noise ratio; sinusoidal demodulation technique; spectrogram patch reconstruction; speech signal reconstruction; speech signal synthesis; vocal tract response; voiced speech; Demodulation; Narrowband; Spectrogram; Speech; Speech processing; Time-frequency analysis; Transforms; Amplitude modulation model of spectrograms; Riesz transform; grating compression transform (GCT); multiband AM-FM; sinusoidal demodulation; spectro-temporal analysis;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2449088
  • Filename
    7131474