Title :
HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
Author :
Park, Ji Hun ; Yoon, Jae Sam ; Kim, Hong Kook
Author_Institution :
Dept. of Inf. & Commun., Gwangju Inst. of Sci. & Technol. (GIST), Gwangju
Abstract :
In this paper, we propose a new mask estimation method for the computational auditory scene analysis (CASA) of speech using two microphones. The proposed method is based on a hidden Markov model (HMM) in order to incorporate an observation that the mask information should be correlated over contiguous analysis frames. In other words, HMM is used to estimate the mask information represented as the interaural time difference (ITD) and the interaural level difference (ILD) of two channel signals, and the estimated mask information is finally employed in the separation of desired speech from noisy speech. To show the effectiveness of the proposed mask estimation, we then compare the performance of the proposed method with that of a Gaussian kernel-based estimation method in terms of the performance of speech recognition. As a result, the proposed HMM-based mask estimation method provided an average word error rate reduction of 69.14% when compared with the Gaussian kernel-based mask estimation method.
Keywords :
estimation theory; hearing; hidden Markov models; microphones; signal representation; speech recognition; HMM-based mask estimation; computational auditory scene analysis; contiguous analysis frame; hidden Markov model; interaural level difference; interaural time difference; mask information representation; microphone; speech recognition front-end; Acoustic noise; Auditory system; Hidden Markov models; Image analysis; Information analysis; Microphones; Pattern analysis; Speech coding; Speech recognition; Working environment noise; Computational auditory scene analysis; Hidden Markov model; Mask estimation; Noise robustness; Speech recognition;
Conference_Titel :
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
Conference_Location :
Trento
Print_ISBN :
978-1-4244-2337-8
Electronic_ISBN :
978-1-4244-2338-5
DOI :
10.1109/HSCMA.2008.4538715