Title :
Multichannel voice activity detection with spherically invariant sparse distributions
Author :
Lee, Bowon ; Kalker, Ton
Author_Institution :
Hewlett-Packard Labs., Palo Alto, CA, USA
Abstract :
We propose a statistical multichannel voice activity detection algorithm by modeling the frequency components of speech signals as sparse multivariate complex distributions. In particular, we formulate a likelihood ratio test by modeling a multichannel speech observation as a spherically invariant random process with a parameter governing its sparseness. In addition, we consider reverberation as a component of the statistical model. Experimental results show that our proposed method significantly reduces false-alarm rate for reverberation tails and that sparse distributions provide higher detection accuracy compared to the traditional Gaussian distribution.
Keywords :
random processes; signal detection; sparse matrices; speech processing; statistical distributions; Gaussian distribution; false-alarm rate; likelihood ratio test; multichannel speech observation modeling; reverberation; speech signal frequency component modeling; spherically invariant random process; spherically invariant sparse multivariate complex distribution; statistical multichannel voice activity detection algorithm; Conferences; Discrete Fourier transforms; Frequency; Gaussian distribution; Light rail systems; Microphone arrays; Reverberation; Signal processing algorithms; Speech processing; Testing; Likelihood ratio test; Maximum-likelihood estimation; Microphone array; Multivariate complex distribution; Voice activity detection;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
DOI :
10.1109/ASPAA.2009.5346523