DocumentCode :
1758718
Title :
Voice Activity Detection Via Noise Reducing Using Non-Negative Sparse Coding
Author :
Teng, Peng ; Jia, Yunde
Author_Institution :
Beijing Lab of Intelligent Information Technology, and the School of Computer Science, Beijing Institute of Technology, Beijing, China
Volume :
20
Issue :
5
fYear :
2013
fDate :
41395
Firstpage :
475
Lastpage :
478
Abstract :
This letter presents a voice activity detection (VAD) approach using non-negative sparse coding to improve the detection performance in low signal-to-noise ratio (SNR) conditions. The basic idea is to use features extracted from a noise-reduced representation of original audio signals. We decompose the magnitude spectrum of an audio signal on a speech dictionary learned from clean speech and a noise dictionary learned from noise samples. Only coefficients corresponding to the speech dictionary are considered and used as the noise-reduced representation of the signal for feature extraction. A conditional random field (CRF) is used to model the correlation between feature sequences and voice activity labels along audio signals. Then, we assign the voice activity labels for a given audio by decoding the CRF. Experimental results demonstrate that our VAD approach has a good performance in low SNR conditions.
Keywords :
Dictionaries; Encoding; Feature extraction; Signal to noise ratio; Speech; Vectors; Conditional random fields; noise reducing; non-negative sparse coding; voice activity detection;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2013.2252615
Filename :
6479683
Link To Document :
بازگشت