• DocumentCode
    3697424
  • Title

    Exciting estimated clean spectra for speech resynthesis

  • Author

    Sreyas Srimath Tirumala;Michael I Mandel

  • Author_Institution
    The Ohio State University, Computer Science &
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Spectral masking techniques are prevalent for noise suppression but they damage speech in regions of the spectrum where both noise and speech are present. This paper instead utilizes a recently introduced analysis-by-synthesis technique to estimate the spectral envelope of the speech at all frequencies, and adds to it a model of the speech excitation necessary to fully resynthesize a clean speech signal. Such a resynthesis should have little noise and high quality compared to mask-based approaches. We compare several different excitation signals on the Aurora4 corpus, including those derived from the high quefrency components of the noisy mixture and from the combination of a noise robust pitch tracker and a voiced/unvoiced classifier. Preliminary subjective evaluations suggest that the speech synthesized using our approach has higher voice quality and noise suppression than spectral masking.
  • Keywords
    "Speech","Noise measurement","Mel frequency cepstral coefficient","Noise robustness","Deconvolution","Speech recognition","Speech processing"
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
  • Type

    conf

  • DOI
    10.1109/WASPAA.2015.7336907
  • Filename
    7336907