• DocumentCode
    495215
  • Title

    Combining Speech Enhancement and Discriminative Feature Extraction for Robust Speaker Recognition

  • Author

    Yan, Zhang ; Zhenmin, Tang ; Yanping, Li

  • Author_Institution
    Jinling Inst. of Technol., Nanjing, China
  • Volume
    5
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    274
  • Lastpage
    278
  • Abstract
    It is well known that discriminative feature and effective robust processing are two key techniques. This paper presents a new strategy which combining speech enhancement and discriminative feature in order to overcome the acoustics mismatch between training and testing data in the noise environment. On the one hand, a comparison results in two noise environments indicate that the recognition rates based on DFCC are averagely higher 6.11% (White noise) and 8%(Factory noise) respectively than MFCC, which confirmed that the effectiveness of discriminative and robustness of DFCC. On the other hand, when combining speech enhancement and discriminative feature, the improvement based on SMFCC is limited, only 0.93%, 1.87%, while the performance has been improved by 2.54%, 2.31% based on SDFCC.
  • Keywords
    acoustic signal processing; feature extraction; speech enhancement; speech recognition; DFCC; MFCC; acoustic signal processing; discriminative feature extraction; discriminative frequency cepstral coefficient; mel-frequency cepstral coefficient; robust speaker recognition; speech enhancement; Acoustics; Cepstral analysis; Feature extraction; Loudspeakers; Mel frequency cepstral coefficient; Noise robustness; Speaker recognition; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.61
  • Filename
    5170540