• DocumentCode
    758356
  • Title

    Generalized LRT-Based Voice Activity Detector

  • Author

    Górriz, Juan Manuel ; Ramírez, Javier ; Puntonet, Carlos G. ; Segura, José Carlos

  • Author_Institution
    Departmento Teoria de la Senal, Telematica y Comunicaciones
  • Volume
    13
  • Issue
    10
  • fYear
    2006
  • Firstpage
    636
  • Lastpage
    639
  • Abstract
    A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on well-known statistical tests based on the determination of the speech/non-speech bispectra by means of third-order auto-cumulants. This algorithm differs from many others in the way the decision rule is formulated being the statistical tests built on a multiple observation (MO) window consisting of averaged bispectrum coefficients of the speech signal. Clear improvements in speech/non-speech discrimination accuracy demonstrate the effectiveness of the proposed VAD. It is shown that application of a statistical detection test leads to a better separation of the speech and noise distributions, thus allowing a more effective discrimination and a tradeoff between complexity and performance. The experimental analysis carried out on the AURORA 3 databases provides an extensive performance evaluation together with an exhaustive comparison to the standard VADs, such as ITU G.729, GSM AMR, and ETSI AFE, for distributed speech recognition (DSR) and other recently reported VADs
  • Keywords
    signal detection; speech processing; speech recognition; statistical distributions; statistical testing; AURORA 3 database; DSR; VAD algorithm; averaged bispectrum coefficient; distributed speech recognition; generalized LRT; likelihood ratio test; multiple observation window; noise distribution; statistical detection test; voice activity detector; Detectors; Distributed databases; GSM; Performance analysis; Robustness; Speech analysis; Speech enhancement; Speech recognition; Testing; Working environment noise; Bispectra analysis; higher order statistics; noise reduction; speech/non-speech detection;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2006.876340
  • Filename
    1703546