Title :
Combining Speech Enhancement and Discriminative Feature Extraction for Robust Speaker Recognition
Author :
Yan, Zhang ; Zhenmin, Tang ; Yanping, Li
Author_Institution :
Jinling Inst. of Technol., Nanjing, China
fDate :
March 31 2009-April 2 2009
Abstract :
It is well known that discriminative feature and effective robust processing are two key techniques. This paper presents a new strategy which combining speech enhancement and discriminative feature in order to overcome the acoustics mismatch between training and testing data in the noise environment. On the one hand, a comparison results in two noise environments indicate that the recognition rates based on DFCC are averagely higher 6.11% (White noise) and 8%(Factory noise) respectively than MFCC, which confirmed that the effectiveness of discriminative and robustness of DFCC. On the other hand, when combining speech enhancement and discriminative feature, the improvement based on SMFCC is limited, only 0.93%, 1.87%, while the performance has been improved by 2.54%, 2.31% based on SDFCC.
Keywords :
acoustic signal processing; feature extraction; speech enhancement; speech recognition; DFCC; MFCC; acoustic signal processing; discriminative feature extraction; discriminative frequency cepstral coefficient; mel-frequency cepstral coefficient; robust speaker recognition; speech enhancement; Acoustics; Cepstral analysis; Feature extraction; Loudspeakers; Mel frequency cepstral coefficient; Noise robustness; Speaker recognition; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.61