DocumentCode :
1796947
Title :
Long-term auto-correlation statistics based voice activity detection for strong noisy speech
Author :
Wei Shi ; Yuexian Zou ; Yi Liu
Author_Institution :
Shenzhen Key Lab. of Intell. Media & Speech, PKU-HKUST Shenzhen-HongKong Instn., Shenzhen, China
fYear :
2014
fDate :
9-13 July 2014
Firstpage :
100
Lastpage :
104
Abstract :
This paper proposes a voice activity detection (VAD) algorithm based on a novel long-term metric. By assuming that the most significant difference between noisy speech and non-speech is the harmonicity of the noisy speech spectrum caused by human nature, the long-term auto-correlation statistics (LTACS) measure is designed to be shown as a powerful metric used in VAD. The LTACS measure is calculated among several successive frames around the concerned frame and it represents the significance of harmonics of the signal spectrum over a long term rather than a short term. A novel LTACS-based VAD algorithm is derived by jointly making use of the minimum operator to reduce non-speech variability and of then calculating variance to detect speech. Simulative comparisons with four standardized VAD algorithms (ETSI adaptive multi-rate option 1 and 2, ETSI advanced front-end and G.729 Annex B) as well as three former proposed VAD algorithms show that the proposed LTACS-based VAD algorithm achieves the best performance under all SNR conditions, especially in strong noisy environments (e.g., SNR is -5dB or -10dB).
Keywords :
noise abatement; speech recognition; ETSI adaptive multirate option; harmonicity; long-term auto-correlation statistics measure; noisy speech spectrum; signal spectrum; strong noisy speech; voice activity detection algorithm; Abstracts; Gold; Indexes; Mobile communication; Noise measurement; Reliability; Speech; long-term auto-correlation statistics; strong noisy speech; voice activity detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
Type :
conf
DOI :
10.1109/ChinaSIP.2014.6889210
Filename :
6889210
Link To Document :
بازگشت