Title :
Speech enhancement using generalized maximum a posteriori spectral amplitude estimator
Author :
Yu-Cheng Su ; Yu Tsao ; Jung-En Wu ; Fu-Rong Jean
Author_Institution :
Res. Center for Inf. Technol. Innovation, Acad. Sinica, Taipei, Taiwan
Abstract :
This paper proposes a generalized maximum a posteriori spectral amplitude (GMAPA) algorithm to spectral restoration for speech enhancement. The proposed GMAPA algorithm dynamically adjusts the scale of prior information to calculate the gain function for spectral restoration. In higher signal-to-noise ratio (SNR) conditions, GMAPA adopts a smaller scale to prevent overcompensations that may result in speech distortions. On the other hand, in lower SNR conditions, GMAPA uses a larger scale to enable the gain function to more effectively remove noise components from noisy speech. We also develop a mapping function to optimally determine the prior information scale according to the SNR of speech utterances. Two standardized speech databases, Aurora-4 and Aurora-2, are used to conduct objective and recognition evaluations, respectively, to test the proposed GMAPA algorithm. For comparison, three conventional spectral restoration algorithms are also evaluated; they are minimum mean-square error spectral estimator (MMSE), maximum likelihood spectral amplitude estimator (MLSA), and maximum a posteriori spectral amplitude estimator (MAPA). The experimental results first confirm that GMAPA provides better objective evaluation scores than MMSE, MLSA, and MAPA in lower SNR conditions, with comparable scores to MLSA in higher SNR conditions. Moreover, our recognition results indicate that GMAPA outperforms the three conventional algorithms consistently over different testing conditions.
Keywords :
audio databases; distortion; maximum likelihood estimation; mean square error methods; signal denoising; signal restoration; speech enhancement; Aurora-2 speech database; Aurora-4 speech database; GMAPA; MLSA; MMSE; gain function; generalized maximum a posteriori spectral amplitude estimator; mapping function; maximum likelihood spectral amplitude estimator; minimum mean-square error spectral estimator; noisy speech; spectral restoration; speech distortions; speech enhancement; speech utterances; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech recognition; Generalized MAPA; MAPA; MLSA; MMSE; Speech enhancement; spectral restoration;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639114