DocumentCode
3697424
Title
Exciting estimated clean spectra for speech resynthesis
Author
Sreyas Srimath Tirumala;Michael I Mandel
Author_Institution
The Ohio State University, Computer Science &
fYear
2015
Firstpage
1
Lastpage
5
Abstract
Spectral masking techniques are prevalent for noise suppression but they damage speech in regions of the spectrum where both noise and speech are present. This paper instead utilizes a recently introduced analysis-by-synthesis technique to estimate the spectral envelope of the speech at all frequencies, and adds to it a model of the speech excitation necessary to fully resynthesize a clean speech signal. Such a resynthesis should have little noise and high quality compared to mask-based approaches. We compare several different excitation signals on the Aurora4 corpus, including those derived from the high quefrency components of the noisy mixture and from the combination of a noise robust pitch tracker and a voiced/unvoiced classifier. Preliminary subjective evaluations suggest that the speech synthesized using our approach has higher voice quality and noise suppression than spectral masking.
Keywords
"Speech","Noise measurement","Mel frequency cepstral coefficient","Noise robustness","Deconvolution","Speech recognition","Speech processing"
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
Type
conf
DOI
10.1109/WASPAA.2015.7336907
Filename
7336907
Link To Document