DocumentCode :
2491266
Title :
Combining speech energy and edge information for fast and efficient voice activity detection in noisy environments
Author :
Li, Xiaokun ; Deng, Yunbin
Author_Institution :
DCM Res. Resources, LLC, Germantown, MD, USA
fYear :
2008
fDate :
8-11 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
Robust voice activity detection (VAD) is a very crucial step and a challenging problem in developing real-time and high-performance speech recognition systems used in noisy environments. In this paper, we present a novel and efficient VAD algorithm for robust and real-time speech activity detection. The key idea of the algorithm is considering speech energy and edge information simultaneously when processing speech signals. A new finite state automaton is also developed for correctly detecting voice activities in noisy environments. Extensive and comparative experimental results show that the proposed VAD algorithm can greatly speed up speech recognition while reducing word error rate (WER) significantly. Compared with the state-of-the-art, the average improvement of using the proposed algorithm on noisy data is 46.5% for processing speed and 15.3% for WER.
Keywords :
error statistics; finite automata; speech recognition; edge information; finite state automaton; noisy environment; speech activity detection; speech energy; speech recognition; speech signal; voice activity detection; word error rate; Automata; Error analysis; Event detection; Information filtering; Information filters; Robustness; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2008.4761906
Filename :
4761906
Link To Document :
بازگشت