DocumentCode
2511940
Title
Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test
Author
Deng, Shiwen ; Han, Jiqing
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
89
Lastpage
92
Abstract
The voice activity detection (VAD) algorithms by using Discrete Fourier Transform (DFT) coefficients are widely found in literature. However, some shortcomings for modeling a signal in the DFT can easily degrade the performance of a VAD in noise environment. To overcome the problem, this paper presents a novel approach by using the complex coefficients derived from complex exponential atomic decomposition of a signal. Those coefficients are modeled by a complex Gaussian probability distribution and a statistical model is employed to derive the decision rule from the likelihood ratio test. According to the experimental results, the proposed VAD method shows better performance than the VAD based on DFT coefficients in various noise environments.
Keywords
Gaussian distribution; discrete Fourier transforms; maximum likelihood estimation; speech processing; Gaussian probability distribution; decision rule; discrete Fourier transform; exponential atomic decomposition; likelihood ratio test; voice activity detection; Discrete Fourier transforms; Harmonic analysis; Matching pursuit algorithms; Noise measurement; Signal to noise ratio; Speech; Likelihood ratio test; Matching Pursuit; Voice activity detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.30
Filename
5597635
Link To Document