Title :
A voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin
Author :
Yinfeng Wang ; Shaoguang Huang ; Ying Wei
Author_Institution :
Sch. of Inf. Sci. & Eng., Shandong Univ., Jinan, China
Abstract :
Voice activity detection algorithms are widely used in the areas of voice compression, speech synthesis, speech recognition, speech enhancement, and etc. In this paper, an efficient voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin is proposed. The proposed sub-band detection consists of two parts: crosswise detection and lengthwise detection. Energy detection and pitch detection are in the range of considerations. For a better performance, double-threshold criterion is used to reduce the misjudgment rate of the detection. Performance evaluation is based on six noise environments with different SNRs. Experiment results indicate that the proposed algorithm can detect the area of voice effectively in non-stationary environment and low SNR environment and has the potential to progress.
Keywords :
speech enhancement; speech recognition; speech synthesis; time-frequency analysis; Mandarin; crosswise detection; double-threshold criterion; energy detection; lengthwise detection; pitch detection; speech enhancement; speech recognition; speech synthesis; time-frequency characteristics; voice activity detection algorithm; voice compression; Accuracy; Acoustics; Filter banks; Signal to noise ratio; Speech; Time-frequency analysis; VAD; mandarin; pitch; sub-band detection;
Conference_Titel :
Image and Signal Processing (CISP), 2013 6th International Congress on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2763-0
DOI :
10.1109/CISP.2013.6743871