• DocumentCode
    2052475
  • Title

    Feature enhancement error compensation for noise robust speech recognition

  • Author

    Gil Ho Lee ; Shin Jae Kang ; Chang Woo Han ; Nam Soo Kim

  • Author_Institution
    DMC R&D enter, Acoust. & Sound Technol. Lab., Suwon, South Korea
  • fYear
    2012
  • fDate
    20-23 March 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents an approach to feature enhancement error compensation for noise robust speech recognition. The conventional feature enhancement techniques estimate the enhanced clean speech from the noise corrupted speech for improving speech recognition performance under noisy environments. During speech feature enhancement process, undesired residual error is generated because of incomplete property of the noise reduction. We apply the switching linear dynamic transducer (SLDT) to compensate this residual error. The SLDT describes the sequence-to-sequence mapping in a systematic way and has been applied to stereo data based speech feature mapping for channel distorted speech recognition. We assume that feature enhancement is a channel. The proposed method shows recognition error reduction in Aurora 2 digit task and Aurora 4 large vocabulary task with the interacting multiple model.
  • Keywords
    error compensation; feature extraction; signal denoising; speech enhancement; speech recognition; transducers; Aurora 2 digit task; Aurora 4 large vocabulary task; SLDT; channel distorted speech recognition; enhanced clean speech estimation; feature enhancement error compensation; noise corrupted speech; noise reduction; noise robust speech recognition; noisy environments; recognition error reduction; residual error; sequence-to-sequence mapping; speech recognition performance improvement; stereo data based speech feature mapping; switching linear dynamic transducer; Adaptation models; Noise; Noise measurement; Speech; Speech enhancement; Speech recognition; Vectors; Switching linear dynamic transducer; error estimation; feature enhancement; noise robust speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Signals and Devices (SSD), 2012 9th International Multi-Conference on
  • Conference_Location
    Chemnitz
  • Print_ISBN
    978-1-4673-1590-6
  • Electronic_ISBN
    978-1-4673-1589-0
  • Type

    conf

  • DOI
    10.1109/SSD.2012.6197929
  • Filename
    6197929