DocumentCode :
667522
Title :
On the use of spectro-temporal features for the IEEE AASP challenge ‘detection and classification of acoustic scenes and events’
Author :
Schroder, Jochen ; Moritz, Niko ; Schadler, Marc Rene ; Cauchi, Benjamin ; Adiloglu, Kamil ; Anemuller, Jorn ; Doclo, Simon ; Kollmeier, Birger ; Goetze, Stefan
Author_Institution :
Project Group Hearing, Speech & Audio Technol., Fraunhofer IDMT, Oldenburg, Germany
fYear :
2013
fDate :
20-23 Oct. 2013
Firstpage :
1
Lastpage :
4
Abstract :
In this contribution, an acoustic event detection system based on spectro-temporal features and a two-layer hidden Markov model as back-end is proposed within the framework of the IEEE AASP challenge `Detection and Classification of Acoustic Scenes and Events´ (D-CASE). Noise reduction based on the log-spectral amplitude estimator by [1] and noise power density estimation by [2] is used for signal enhancement. Performance based on three different kinds of features is compared, i.e. for amplitude modulation spectrogram, Gabor filterbank-features and conventional Mel-frequency cepstral coefficients (MFCCs), all of them known from automatic speech recognition (ASR). The evaluation is based on the office live recordings provided within the D-CASE challenge. The influence of the signal enhancement is investigated and the increase in recognition rate by the proposed features in comparison to MFCC-features is shown. It is demonstrated that the proposed spectro-temporal features achieve a better recognition accuracy than MFCCs.
Keywords :
Gabor filters; hidden Markov models; signal classification; speech enhancement; speech recognition; D-CASE challenge; Gabor filterbank-features; IEEE AASP challenge; Mel-frequency cepstral coefficients; acoustic event detection system; automatic speech recognition; detection and classification of acoustic scenes and events; log-spectral amplitude estimator; noise power density estimation; noise reduction; office live recordings; signal enhancement; spectro-temporal features; two-layer hidden Markov model; Acoustics; Event detection; Feature extraction; Frequency modulation; Hidden Markov models; Noise; Speech; Gabor filterbank; IEEE AASP D-CASE challenge; acoustic event detection; amplitude modulation spectrogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Type :
conf
DOI :
10.1109/WASPAA.2013.6701868
Filename :
6701868
Link To Document :
بازگشت