DocumentCode :
2852500
Title :
Phone based voice activity detection using online Bayesian adaptation with conjugate normal distributions
Author :
Zhang, Jianping ; Ward, Wayne ; Pellom, Bryan
Author_Institution :
Center for Spoken Language Research, University of Colorado at Boulder, 80309-0594, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
In this paper, we developed a highly efficient frame-level online adaptive voice activity detection (VAD) algorithm for the telephone-based CU Communicator spoken dialog system. The adaptive algorithm uses prior speaker and channel statistics as well as acoustic features of current sample frames to update model parameters. The algorithm achieved .05xRT in contrast to .7xRT of a compared VAD algorithm using 5-state HMMs. We detail the adaptive algorithm and address some real-time implementation issues. Experiments on live collected data show that there is a 23% error reduction compared with G.729B VAD.
Keywords :
Artificial neural networks; Bayesian methods; Gold;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743719
Filename :
5743719
Link To Document :
بازگشت