DocumentCode :
1932596
Title :
Combining MFCC and Pitch to Enhance the Performance of the Gender Recognition
Author :
Ting, Huang ; Yingchun, Yang ; Zhaohui, Wu
Author_Institution :
Dept. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou
Volume :
1
fYear :
2006
fDate :
16-20 2006
Abstract :
This paper describes a novel approach which combines the acoustic analysis using MFCC and the speaker´s mean pitch to improve the performance of the gender recognition. In acoustic analysis, two sets of Gaussian mixture model (GMM), male and female, are trained from the speech, and the most likely sequence of models with corresponding likelihood scores are produced. In pitch estimation approach, a threshold is specified to differentiate the two sets. The information provided by the acoustic analysis using MFCC and pitch estimation are combined by using a linear normalization fusion method. The system was tested on the SRMC databases giving at most 3.3% recognition error rate
Keywords :
Gaussian processes; speech recognition; GMM; Gaussian mixture model; MFCC; acoustic analysis; gender recognition; linear normalization fusion method; pitch estimation approach; Acoustic testing; Databases; Error analysis; Information analysis; Loudspeakers; Mel frequency cepstral coefficient; Performance analysis; Personal digital assistants; Speech analysis; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2006 8th International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9736-3
Electronic_ISBN :
0-7803-9736-3
Type :
conf
DOI :
10.1109/ICOSP.2006.345541
Filename :
4128956
Link To Document :
بازگشت