DocumentCode :
2732334
Title :
Voice Activity Detection Using Entropy in Spectrum Domain
Author :
Asgari, Meysam ; Sayadian, Abolghasem ; Farhadloo, Mohsen ; Mehrizi, Elahe Abouie
Author_Institution :
Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
fYear :
2008
fDate :
7-10 Dec. 2008
Firstpage :
407
Lastpage :
410
Abstract :
In this paper we develop a voice activity detection algorithm based on entropy estimation of magnitude spectrum. In addition, the likelihood ratio test (LRT) is employed to determine a threshold to separate of speech segments from non-speech segments. The distributions of entropy magnitude of clean speech and noise signal are assumed to be Gaussian. The application of the concept of entropy to the speech detection problem is based on the assumption that the signal spectrum is more organized during speech segments than during noise segments. One of the main advantages of this method is that it is not very sensitive to the changes of noise level. Our simulation results show that the entropy based VAD is high performance in low signal to noise ratio (SNR) conditions (SNR < 0 dB).
Keywords :
Gaussian processes; entropy; maximum likelihood detection; speech recognition; Gaussian process; entropy estimation; likelihood ratio test; magnitude spectrum; noise signal; nonspeech segments; spectrum domain; speech signal; voice activity detection; Bandwidth; Bit rate; Entropy; Light rail systems; Linear predictive coding; Radio frequency; Signal to noise ratio; Speech coding; Speech enhancement; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunication Networks and Applications Conference, 2008. ATNAC 2008. Australasian
Conference_Location :
Adelaide, SA
Print_ISBN :
978-1-4244-2602-7
Electronic_ISBN :
978-1-4244-2603-4
Type :
conf
DOI :
10.1109/ATNAC.2008.4783359
Filename :
4783359
Link To Document :
بازگشت