Title :
Sparse power spectrum based robust voice activity detector
Author :
You, Datao ; Han, Jiqing ; Zheng, Guibin ; Zheng, Tieran
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
This paper presents a robust approach to improve the performance of voice activity detector (VAD) in low signal-to-noise ratio (SNR) noisy environments. To this end, we first generate sparse representations by Bregman Iteration based sparse decomposition with a learned over-complete dictionary, and derive a kind of audio feature called sparse power spectrum from the sparse representations. we then propose a method to calculate the short segment average spectrum and long segment average spectrum from sparse power spectrum. Finally, we design a criterion to detect speech region and non-speech region based on the above average spectrum. Experiments show that the proposed approach further improves the performance of VAD in low SNR noisy environments.
Keywords :
audio signal processing; Bregman iteration based sparse decomposition; audio feature; low SNR noisy environment; robust voice activity detector; segment average spectrum; signal to noise ratio noisy environment; sparse power spectrum; sparse representation; speech region detection; Dictionaries; Noise measurement; Robustness; Signal to noise ratio; Speech; Vectors; Sparse decomposition; average energy; sparse spectrum; voice activity detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287874