• DocumentCode
    2511940
  • Title

    Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test

  • Author

    Deng, Shiwen ; Han, Jiqing

  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    89
  • Lastpage
    92
  • Abstract
    The voice activity detection (VAD) algorithms by using Discrete Fourier Transform (DFT) coefficients are widely found in literature. However, some shortcomings for modeling a signal in the DFT can easily degrade the performance of a VAD in noise environment. To overcome the problem, this paper presents a novel approach by using the complex coefficients derived from complex exponential atomic decomposition of a signal. Those coefficients are modeled by a complex Gaussian probability distribution and a statistical model is employed to derive the decision rule from the likelihood ratio test. According to the experimental results, the proposed VAD method shows better performance than the VAD based on DFT coefficients in various noise environments.
  • Keywords
    Gaussian distribution; discrete Fourier transforms; maximum likelihood estimation; speech processing; Gaussian probability distribution; decision rule; discrete Fourier transform; exponential atomic decomposition; likelihood ratio test; voice activity detection; Discrete Fourier transforms; Harmonic analysis; Matching pursuit algorithms; Noise measurement; Signal to noise ratio; Speech; Likelihood ratio test; Matching Pursuit; Voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.30
  • Filename
    5597635