• DocumentCode
    2701816
  • Title

    Missing Feature Speech Recognition using Dereverberation and Echo Suppression in Reverberant Environments

  • Author

    Hyung-Min Park ; Stern, Richard M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    This paper describes an algorithm that efficiently segregates desired speech features from spatially-separated interfering sources in reverberant environments. Although most binaural segregation techniques successfully remove interference components in the absence of reverberation, source segregation in reverberant environments remains a challenging problem. In order to reduce the effects of reverberation, we present a method that dereverberates input signals before they are segregated. The dereverberation filter is estimated from the autocorrelation of the observations and primarily deals with early reflections, while late reflections are effectively suppressed by an inhibitory mechanism that estimates their relative contribution in each time-frequency segment. Information about the salience of the target in a given time-frequency segment based on source separation is combined with the corresponding information based on reverberation suppression through the use of a continually-variable weighting function or mask. Use of the novel reverberation processing results in a relative decrease in WER of 11.5% to 20.9% and use of the combined approaches reduces relative WER by as much as 65.3%.
  • Keywords
    echo suppression; filtering theory; reverberation; speech processing; speech recognition; autocorrelation; continually-variable weighting function; dereverberation filter; echo suppression; feature speech recognition; reverberant environments; source segregation; spatially-separated interfering sources; time-frequency segment; Autocorrelation; Automatic speech recognition; Degradation; Filter bank; Humans; Natural languages; Reverberation; Signal processing; Speech recognition; Time frequency analysis; Speech recognition; binaural processing; dereverberation; missing feature theory; spatial segregation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366929
  • Filename
    4218117