DocumentCode
2852500
Title
Phone based voice activity detection using online Bayesian adaptation with conjugate normal distributions
Author
Zhang, Jianping ; Ward, Wayne ; Pellom, Bryan
Author_Institution
Center for Spoken Language Research, University of Colorado at Boulder, 80309-0594, USA
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
In this paper, we developed a highly efficient frame-level online adaptive voice activity detection (VAD) algorithm for the telephone-based CU Communicator spoken dialog system. The adaptive algorithm uses prior speaker and channel statistics as well as acoustic features of current sample frames to update model parameters. The algorithm achieved .05xRT in contrast to .7xRT of a compared VAD algorithm using 5-state HMMs. We detail the adaptive algorithm and address some real-time implementation issues. Experiments on live collected data show that there is a 23% error reduction compared with G.729B VAD.
Keywords
Artificial neural networks; Bayesian methods; Gold;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743719
Filename
5743719
Link To Document