• DocumentCode
    2732334
  • Title

    Voice Activity Detection Using Entropy in Spectrum Domain

  • Author

    Asgari, Meysam ; Sayadian, Abolghasem ; Farhadloo, Mohsen ; Mehrizi, Elahe Abouie

  • Author_Institution
    Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
  • fYear
    2008
  • fDate
    7-10 Dec. 2008
  • Firstpage
    407
  • Lastpage
    410
  • Abstract
    In this paper we develop a voice activity detection algorithm based on entropy estimation of magnitude spectrum. In addition, the likelihood ratio test (LRT) is employed to determine a threshold to separate of speech segments from non-speech segments. The distributions of entropy magnitude of clean speech and noise signal are assumed to be Gaussian. The application of the concept of entropy to the speech detection problem is based on the assumption that the signal spectrum is more organized during speech segments than during noise segments. One of the main advantages of this method is that it is not very sensitive to the changes of noise level. Our simulation results show that the entropy based VAD is high performance in low signal to noise ratio (SNR) conditions (SNR < 0 dB).
  • Keywords
    Gaussian processes; entropy; maximum likelihood detection; speech recognition; Gaussian process; entropy estimation; likelihood ratio test; magnitude spectrum; noise signal; nonspeech segments; spectrum domain; speech signal; voice activity detection; Bandwidth; Bit rate; Entropy; Light rail systems; Linear predictive coding; Radio frequency; Signal to noise ratio; Speech coding; Speech enhancement; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunication Networks and Applications Conference, 2008. ATNAC 2008. Australasian
  • Conference_Location
    Adelaide, SA
  • Print_ISBN
    978-1-4244-2602-7
  • Electronic_ISBN
    978-1-4244-2603-4
  • Type

    conf

  • DOI
    10.1109/ATNAC.2008.4783359
  • Filename
    4783359