Combining Speech Enhancement and Discriminative Feature Extraction for Robust Speaker Recognition

Author

Yan, Zhang ; Zhenmin, Tang ; Yanping, Li

Author_Institution

Jinling Inst. of Technol., Nanjing, China

Volume

5

fYear

2009

fDate

March 31 2009-April 2 2009

Firstpage

274

Lastpage

278

Abstract

It is well known that discriminative feature and effective robust processing are two key techniques. This paper presents a new strategy which combining speech enhancement and discriminative feature in order to overcome the acoustics mismatch between training and testing data in the noise environment. On the one hand, a comparison results in two noise environments indicate that the recognition rates based on DFCC are averagely higher 6.11% (White noise) and 8%(Factory noise) respectively than MFCC, which confirmed that the effectiveness of discriminative and robustness of DFCC. On the other hand, when combining speech enhancement and discriminative feature, the improvement based on SMFCC is limited, only 0.93%, 1.87%, while the performance has been improved by 2.54%, 2.31% based on SDFCC.

Keywords

acoustic signal processing; feature extraction; speech enhancement; speech recognition; DFCC; MFCC; acoustic signal processing; discriminative feature extraction; discriminative frequency cepstral coefficient; mel-frequency cepstral coefficient; robust speaker recognition; speech enhancement; Acoustics; Cepstral analysis; Feature extraction; Loudspeakers; Mel frequency cepstral coefficient; Noise robustness; Speaker recognition; Speech enhancement; Speech recognition; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Information Engineering, 2009 WRI World Congress on

Conference_Location

Los Angeles, CA

Print_ISBN

978-0-7695-3507-4

Type

conf

DOI

10.1109/CSIE.2009.61

Filename

5170540