• DocumentCode
    1652119
  • Title

    Blind estimation of reverberation time based on spectro-temporal modulation filtering

  • Author

    Feifei Xiong ; Goetze, Stefan ; Meyer, Bernd T.

  • Author_Institution
    Project Group Hearing-, Speech- & Audio-Technol. (HSA), Fraunhofer Inst. for Digital Media Technol. (IDMT), Oldenburg, Germany
  • fYear
    2013
  • Firstpage
    443
  • Lastpage
    447
  • Abstract
    A novel method for blind estimation of the reverberation time (RT60) is proposed based on applying spectro-temporal modulation filters to time-frequency representations. 2D-Gabor filters arranged in a filterbank enable an analysis of the properties of temporal, spectral, and spectro-temporal filtering for this task. Features are used as input to a multi-layer perceptron (MLP) classifier combined with a simple decision rule that attributes a specific RT60 to a given utterance and allows to assess the reliability of the approach for different resolutions of RT60 classification. While the filter set including temporal, spectral, and spectro-temporal filters already outperforms an MFCC baseline, the error rates are further reduced when relying on diagonal spectro-temporal filters alone. The average error rate is 1.9% for the best feature set, which corresponds to a relative reduction of 58.3% compared to the MFCC baseline for RT60s in 0.1 s resolution.
  • Keywords
    Gabor filters; blind source separation; filtering theory; multilayer perceptrons; reverberation; signal classification; time-frequency analysis; 2D-Gabor filters; MFCC baseline; MLP classifier; RT60 classification; blind estimation; decision rule; diagonal spectro-temporal filters; error rates; filterbank; multilayer perceptron classifier; relative reduction; reliability; reverberation time; spectral filtering; spectro-temporal filtering; spectro-temporal modulation filtering; spectro-temporal modulation filters; time-frequency representations; Estimation; Frequency modulation; Mel frequency cepstral coefficient; Reverberation; Speech; 2D Gabor filterbank; Blind reverberation time estimation; spectro-temporal modulation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6637686
  • Filename
    6637686