Title :
Voice Activity Detection Algorithm Based on Mel-scale Frequency Log-Spectral Energy Difference In Noise Environment
Author :
Gang, Niu ; Kai, Wang ; Xizhi, Feng ; Naishu, Chen
Author_Institution :
Ordnance Tech. Inst. of Ordnance Eng. Coll., Shijiazhuang, China
Abstract :
Voice Activity Detection (VAD) is an important part of the speech signal processing, its accuracy directly influences the speed and result of the speech signal processing. Most means of VAD is done in laboratory-scale environment, it requires stationary noise and high Signal Noise Ratio (SNR). But in fact, these conditions above can´t be satisfied usually. A VAD algorithm is put forward based on “Mel-scale Frequency Log-Spectral Energy Difference”, which has easy distance-measuring degree and clear physical sense. Compared with the traditional method, this algorithm used the relative dimension to replace the absolute one, so, in the case of low SNR and in the environment of slowly varying and non-stationary noise, this algorithm can demarcate speech in noise accurately, and meanwhile, it has good robustness.
Keywords :
signal denoising; speech processing; Mel-scale frequency log-spectral energy difference; distance-measuring degree; laboratory-scale environment; noise environment; nonstationary noise; signal noise ratio; speech signal processing; voice activity detection algorithm; Detection algorithms; Frequency; Optical noise; Optical signal processing; Signal processing; Signal processing algorithms; Signal to noise ratio; Speech enhancement; Speech processing; Working environment noise; Log-Spectral Energy Difference; Low SNR; Mel-Scale; VAD;
Conference_Titel :
Information and Computing (ICIC), 2010 Third International Conference on
Conference_Location :
Wuxi, Jiang Su
Print_ISBN :
978-1-4244-7081-5
Electronic_ISBN :
978-1-4244-7082-2
DOI :
10.1109/ICIC.2010.324