DocumentCode :
3334687
Title :
A voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin
Author :
Yinfeng Wang ; Shaoguang Huang ; Ying Wei
Author_Institution :
Sch. of Inf. Sci. & Eng., Shandong Univ., Jinan, China
Volume :
03
fYear :
2013
fDate :
16-18 Dec. 2013
Firstpage :
1287
Lastpage :
1291
Abstract :
Voice activity detection algorithms are widely used in the areas of voice compression, speech synthesis, speech recognition, speech enhancement, and etc. In this paper, an efficient voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin is proposed. The proposed sub-band detection consists of two parts: crosswise detection and lengthwise detection. Energy detection and pitch detection are in the range of considerations. For a better performance, double-threshold criterion is used to reduce the misjudgment rate of the detection. Performance evaluation is based on six noise environments with different SNRs. Experiment results indicate that the proposed algorithm can detect the area of voice effectively in non-stationary environment and low SNR environment and has the potential to progress.
Keywords :
speech enhancement; speech recognition; speech synthesis; time-frequency analysis; Mandarin; crosswise detection; double-threshold criterion; energy detection; lengthwise detection; pitch detection; speech enhancement; speech recognition; speech synthesis; time-frequency characteristics; voice activity detection algorithm; voice compression; Accuracy; Acoustics; Filter banks; Signal to noise ratio; Speech; Time-frequency analysis; VAD; mandarin; pitch; sub-band detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image and Signal Processing (CISP), 2013 6th International Congress on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2763-0
Type :
conf
DOI :
10.1109/CISP.2013.6743871
Filename :
6743871
Link To Document :
بازگشت