DocumentCode
231537
Title
Voice activity detection based on ensemble empirical mode decomposition and teager kurtosis
Author
Chong Feng ; Chunhui Zhao
Author_Institution
Coll. of Inf. & Commun. Eng., Harbin Eng. Univ., Harbin, China
fYear
2014
fDate
19-23 Oct. 2014
Firstpage
455
Lastpage
460
Abstract
This paper proposes an improved voice activity detection (VAD) methodology based on ensemble empirical mode decomposition (EEMD) algorithm and the teager kurtosis to avoid the defect of empirical mode decomposition (EMD) in mode mixing. The teager energy operator is used to track the modulation energy of each intrinsic mode function (IMF), decomposed by ensemble empirical mode decomposition. The root power function and order statistics filter are used on the teager kurtosis for feature extraction. Voice activity detection can be implemented over the suitable threshold which can be automatically estimated by tracking the minimum of the extracted feature values. Experiments show that the proposed VAD can achieve comparable results at high signal-to-noise ratio (SNR). For low SNR conditions, it is able to maintain lower error detection ratio and higher detection ratio, compared with those of the original algorithm.
Keywords
error detection; feature extraction; filtering theory; speech processing; EEMD; IMF; SNR; VAD; VOICE ACTIVITY DETECTION; ensemble empirical mode decomposition; error detection ratio; feature extraction; intrinsic mode function; root power function; signal-to-noise ratio; statistic filter; teager kurtosis; Empirical mode decomposition; Feature extraction; Noise measurement; Signal to noise ratio; Speech; White noise; ensemble empirical mode decomposition; root power function; teager kurtosis; voice activity detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location
Hangzhou
ISSN
2164-5221
Print_ISBN
978-1-4799-2188-1
Type
conf
DOI
10.1109/ICOSP.2014.7015047
Filename
7015047
Link To Document