• DocumentCode
    3349055
  • Title

    A semi-continuous state transition probability HMM-based voice activity detection

  • Author

    Othman, H. ; Abounasr, T.

  • Author_Institution
    Sch. of Inf. Technol. & Eng., Univ. of Ottawa, Ont., Canada
  • Volume
    5
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    In this paper, we introduce an efficient hidden Markov model-based voice activity detection (VAD) algorithm with time-variant state transition probabilities in the underlying Markov chain. The transition probabilities vary in an exponential charge/discharge scheme and are softly merged with state conditional likelihood into a final VAD decision. Working in the domain of ITU-T G.729 parameters, with no additional cost for feature extraction, the proposed algorithm significantly outperforms G.729 Annex B VAD while providing a balanced tradeoff between clipping and false detection errors. The performance compares very favorably with adaptive multirate VAD, phase 2 (AMR2).
  • Keywords
    feature extraction; hidden Markov models; speech processing; HMM-based voice activity detection; ITU-T G.729; Markov chain; VAD decision; adaptive multirate VAD; clipping; exponential charge/discharge scheme; false detection errors; feature extraction; hidden Markov model; semi-continuous state transition probability; state conditional likelihood; time-variant state transition probabilities; Background noise; Communication standards; Costs; Echo cancellers; Feature extraction; Hidden Markov models; Information technology; Noise cancellation; Noise generators; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1327237
  • Filename
    1327237