Title :
Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival Estimates
Author :
Rubio, J.E. ; Ishizuka, K. ; Sawada, Hideyuki ; Araki, Shunsuke ; Nakatani, Takeshi ; Fujimoto, Mitoshi
Author_Institution :
NTT Commun. Sci. Lab., NTT Corp., Tokyo, Japan
Abstract :
Voice activity detection (VAD) systems have been the object of continuous research during the last three decades. While single microphone systems cannot take advantage of certain spatial properties of speech signals, microphone array systems consisting of many elements based on beamforming techniques can be difficult to implement in reality due to cost and complexity issues. The aim of the work described in this paper was to achieve both practical feasibility and spatial discrimination ability. A new approach is developed for two-microphone VAD capable of profiting from the concentration of speech energy in time, frequency and space. The algorithm is implemented and compared with several standard VAD algorithms, such as AFE, AMR and G.729B, and other recently proposed systems, revealing promising results under real-world noise conditions. The main advantage of the proposed approach is its capacity to outperform the above methods without the need for any spatial or spectral constraints, which makes it both versatile and capable of further improvement.
Keywords :
acoustic signal detection; acoustic signal processing; direction-of-arrival estimation; microphone arrays; speech processing; beamforming techniques; direction of arrival estimates; homogeneity; microphone array systems; spatial discrimination ability; spectral constraints; speech signals; two-microphone voice activity detection; Acoustic noise; Acoustic signal detection; Array signal processing; Direction of arrival estimation; Microphone arrays; Noise robustness; Object detection; Signal to noise ratio; Speech enhancement; Testing; Acoustic arrays; Acoustic signal detection; Direction of arrival estimation; Robustness; Speech processing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366930