Title :
Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra
Author :
Turan, M. A. Tugtekin ; Erzin, E.
Author_Institution :
Multimedia, Vision & Graphics Lab., Koc Univ., Istanbul, Turkey
Abstract :
We investigate spectral envelope mapping problem with joint analysis of throat- and acoustic-microphone recordings to enhance throat-microphone speech. A new phone-dependent GMM-based spectral envelope mapping scheme, which performs the minimum mean square error (MMSE) estimation of the acoustic-microphone spectral envelope, has been proposed. Experimental evaluations are performed to compare the proposed mapping scheme to the state-of-theart GMM-based estimator using both objective and subjective evaluations. Objective evaluations are performed with the log-spectral distortion (LSD) and the wideband perceptual evaluation of speech quality (PESQ) metrics. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed phone-dependent mapping consistently improves performances over the state-of-the-art GMM estimator.
Keywords :
Gaussian processes; learning (artificial intelligence); least mean squares methods; speech enhancement; A-B pair comparison listening test; LSD; MMSE estimation; PESQ metrics; acoustic-microphone recordings; learning phone-dependent mappings; log-spectral distortion; minimum mean square error estimation; objective evaluations; phone-dependent GMM; spectral envelope mapping problem; speech enhancement; speech quality metrics; speech spectra; subjective evaluations; throat microphone recordings; wideband perceptual evaluation; Acoustics; Microphones; Robustness; Speech; Speech enhancement; Speech recognition; spectral envelope estimation; speech enhancement; throat-microphone;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639029