DocumentCode
2732334
Title
Voice Activity Detection Using Entropy in Spectrum Domain
Author
Asgari, Meysam ; Sayadian, Abolghasem ; Farhadloo, Mohsen ; Mehrizi, Elahe Abouie
Author_Institution
Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
fYear
2008
fDate
7-10 Dec. 2008
Firstpage
407
Lastpage
410
Abstract
In this paper we develop a voice activity detection algorithm based on entropy estimation of magnitude spectrum. In addition, the likelihood ratio test (LRT) is employed to determine a threshold to separate of speech segments from non-speech segments. The distributions of entropy magnitude of clean speech and noise signal are assumed to be Gaussian. The application of the concept of entropy to the speech detection problem is based on the assumption that the signal spectrum is more organized during speech segments than during noise segments. One of the main advantages of this method is that it is not very sensitive to the changes of noise level. Our simulation results show that the entropy based VAD is high performance in low signal to noise ratio (SNR) conditions (SNR < 0 dB).
Keywords
Gaussian processes; entropy; maximum likelihood detection; speech recognition; Gaussian process; entropy estimation; likelihood ratio test; magnitude spectrum; noise signal; nonspeech segments; spectrum domain; speech signal; voice activity detection; Bandwidth; Bit rate; Entropy; Light rail systems; Linear predictive coding; Radio frequency; Signal to noise ratio; Speech coding; Speech enhancement; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunication Networks and Applications Conference, 2008. ATNAC 2008. Australasian
Conference_Location
Adelaide, SA
Print_ISBN
978-1-4244-2602-7
Electronic_ISBN
978-1-4244-2603-4
Type
conf
DOI
10.1109/ATNAC.2008.4783359
Filename
4783359
Link To Document