DocumentCode
1652119
Title
Blind estimation of reverberation time based on spectro-temporal modulation filtering
Author
Feifei Xiong ; Goetze, Stefan ; Meyer, Bernd T.
Author_Institution
Project Group Hearing-, Speech- & Audio-Technol. (HSA), Fraunhofer Inst. for Digital Media Technol. (IDMT), Oldenburg, Germany
fYear
2013
Firstpage
443
Lastpage
447
Abstract
A novel method for blind estimation of the reverberation time (RT60) is proposed based on applying spectro-temporal modulation filters to time-frequency representations. 2D-Gabor filters arranged in a filterbank enable an analysis of the properties of temporal, spectral, and spectro-temporal filtering for this task. Features are used as input to a multi-layer perceptron (MLP) classifier combined with a simple decision rule that attributes a specific RT60 to a given utterance and allows to assess the reliability of the approach for different resolutions of RT60 classification. While the filter set including temporal, spectral, and spectro-temporal filters already outperforms an MFCC baseline, the error rates are further reduced when relying on diagonal spectro-temporal filters alone. The average error rate is 1.9% for the best feature set, which corresponds to a relative reduction of 58.3% compared to the MFCC baseline for RT60s in 0.1 s resolution.
Keywords
Gabor filters; blind source separation; filtering theory; multilayer perceptrons; reverberation; signal classification; time-frequency analysis; 2D-Gabor filters; MFCC baseline; MLP classifier; RT60 classification; blind estimation; decision rule; diagonal spectro-temporal filters; error rates; filterbank; multilayer perceptron classifier; relative reduction; reliability; reverberation time; spectral filtering; spectro-temporal filtering; spectro-temporal modulation filtering; spectro-temporal modulation filters; time-frequency representations; Estimation; Frequency modulation; Mel frequency cepstral coefficient; Reverberation; Speech; 2D Gabor filterbank; Blind reverberation time estimation; spectro-temporal modulation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6637686
Filename
6637686
Link To Document