• DocumentCode
    2852500
  • Title

    Phone based voice activity detection using online Bayesian adaptation with conjugate normal distributions

  • Author

    Zhang, Jianping ; Ward, Wayne ; Pellom, Bryan

  • Author_Institution
    Center for Spoken Language Research, University of Colorado at Boulder, 80309-0594, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    In this paper, we developed a highly efficient frame-level online adaptive voice activity detection (VAD) algorithm for the telephone-based CU Communicator spoken dialog system. The adaptive algorithm uses prior speaker and channel statistics as well as acoustic features of current sample frames to update model parameters. The algorithm achieved .05xRT in contrast to .7xRT of a compared VAD algorithm using 5-state HMMs. We detail the adaptive algorithm and address some real-time implementation issues. Experiments on live collected data show that there is a 23% error reduction compared with G.729B VAD.
  • Keywords
    Artificial neural networks; Bayesian methods; Gold;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743719
  • Filename
    5743719