DocumentCode :
2970813
Title :
Extended Minimum Classification Error Training in Voice Activity Detection
Author :
Arakawa, Takayuki ; Al-Hassanieh, Haitham ; Tsujikawa, Masanori ; Isotani, Ryosuke
Author_Institution :
Common Platform Software Res. Labs., NEC Corp., Kawasaki, Japan
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
232
Lastpage :
236
Abstract :
Voice activity detection (VAD) is a fundamental part of speech processing. Combination of multiple acoustic features is an effective approach to make VAD more robust against various noise conditions. There have been proposed several feature combination methods, in which weights for feature values are optimized based on minimum classification error (MCE) training. We improve these MCE-based methods by introducing a novel discriminative function for whole frames. The proposed method optimizes combination weights taking into account the ratio between false acceptance and false rejection rates as well as the effect of the use of shaping procedures such as hangover.
Keywords :
pattern classification; signal detection; speech processing; speech recognition; MCE-based methods; extended minimum classification error training; false acceptance rate; false rejection rates; speech processing; speech recognition; voice activity detection; Acoustic signal detection; Communications technology; Computer errors; Laboratories; National electric code; Noise robustness; Optimization methods; Speech processing; Training data; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5373251
Filename :
5373251
Link To Document :
بازگشت