Title :
An effective missing feature compensation method for speech recognition at noisy environment
Author :
Xu-Yan Hu ; Yue-Xian Zou ; Wei Shi
Author_Institution :
ELIP/ADSPLAB, Peking Univ., Shenzhen, China
Abstract :
It is a challenge task for maintaining high correct word accuracy rate (WAR) for state-of-art automatic speech recognition (ASR) systems when the SNR goes very low. To deal with such situation, the missing feature technology (MFT) has shown as one of the mainstream algorithms. In principle, conventional MFT firstly separate the unreliable spectral bins from the reliable ones. Then the unreliable bins are reconstructed by missing feature algorithm [7]. When SNR goes low, the performance of the conventional MFT for ASR system is limited since both the reliable and unreliable spectral bins will be corrupted by the noise components. In this paper, a novel missing feature compensation method was developed by considering compensating both unreliable and reliable spectral bins. With the assumption of GMM distribution of the clean speech spectral vector, a dual MFT (DMFT) algorithm is developed, where the reliable spectral bins corrupted by noise have been compensated by removing the noise components. Several experiments have been carried out to evaluate the performance of the proposed DMFT algorithm by using AURORA2 database. From the results, it is clear to see that the proposed DMFT algorithm improves the WAR under all types of noises at different SNR levels compared with the traditional MFT algorithm.
Keywords :
feature extraction; signal denoising; speech recognition; vectors; AURORA2 database; DMFT algorithm; GMM distribution; SNR levels; automatic speech recognition systems; clean speech spectral vector; dual MFT algorithm; missing feature compensation method; missing feature technology; noisy environment; reliable spectral bins; spectral bins; word accuracy rate; Hidden Markov models; Noise measurement; Reliability; Signal to noise ratio; Speech; Vectors; GMM; feature compensation; missing feature technology; noisy environment; speech recognition;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
DOI :
10.1109/ChinaSIP.2014.6889217