Title of article :
ROBUST SPEAKER GENDER IDENTIFICATION USING EMPIRICAL MODE DECOMPOSITION-BASED CEPSTRAL FEATURES
Author/Authors :
alipoor, ghasem hamedan university of technology - electrical engineering department, ايران , samadi, ehsan hamedan university of technology - electrical engineering department, ايران
From page :
71
To page :
81
Abstract :
Automatic speaker gender identification is a field of research with numerous practical applications. However, this issue has not gained its deserved attention, in particular in the presence of environmental noises. In this paper, using the empirical mode decomposition (EMD), some new and improved mel-frequency cepstral coefficient (MFCC) features are developed to address the problem of robust speaker gender identification. In the proposed approach, EMD is employed as a filter bank to decompose the speech signal into its frequency bands. Furthermore, another variant is also developed in which the complete ensemble EMD (CEEMD) supersedes the EMD. Moreover, support vector machine (SVM) with radial basis function (RBF) kernel is employed for classification. Performance of these methods is examined for gender identification, in noise-free environments as well as in the presence of various Gaussian and non-Gaussian noises. Simulation results show that, although with fewer features used, utilizing the improved EMD-based cepstral features in noiseless situations leads to the same accuracy as that of the original MFCCs. However, in noisy environments the proposed methods outperform the conventional way of extracting the MFCCs.
Keywords :
Automatic Gender Identification , Empirical Mode Decomposition , Mel , Frequency Cepstral Coefficients , Support Vector Machine
Journal title :
Asia-Pacific Journal Of Information Technology an‎d Multimedia
Journal title :
Asia-Pacific Journal Of Information Technology an‎d Multimedia
Record number :
2699070
Link To Document :
بازگشت