Title :
A Bag-of-Features approach to acoustic event detection
Author :
Plinge, Axel ; Grzeszick, Rene ; Fink, Glenn A.
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. Dortmund, Dortmund, Germany
Abstract :
The classification of acoustic events in indoor environments is an important task for many practical applications in smart environments. In this paper a novel approach for classifying acoustic events that is based on a Bag-of-Features approach is proposed. Mel and gammatone frequency cepstral coefficients that originate from psychoacoustic models are used as input features for the Bag-of representation. Rather than using a prior classification or segmentation step to eliminate silence and background noise, Bag-of-Features representations are learned for a background class. Supervised learning of codebooks and temporal coding are shown to improve the recognition rates. Three different databases are used for the experiments: the CLEAR sound event dataset, the D-CASE event dataset and a new set of smart room recordings.
Keywords :
acoustic signal detection; cepstral analysis; indoor environment; learning (artificial intelligence); signal classification; signal representation; CLEAR sound event dataset; D-CASE event dataset; Mel frequency cepstral coefficients; acoustic event classification; acoustic event detection; background noise; bag-of representation; bag-of-features approach; bag-of-features representations; gammatone frequency cepstral coefficients; indoor environments; psychoacoustic models; smart room recordings; supervised learning; temporal coding; Event detection; Mel frequency cepstral coefficient; Speech; Speech processing; Training; Vectors; Bag-of-Features; Event detection; sound classification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854293