DocumentCode :
2871644
Title :
Computational auditory scene recognition
Author :
Peltonen, Vesa ; Tuomi, Juha ; Klapuri, Anssi ; Huopaniemi, Jyri ; Sorsa, Timo
Author_Institution :
Tampere University of Technology, Signal Processing Laboratory, P.O.Box 553, FIN-33101, Finland
Volume :
2
fYear :
2002
fDate :
13-17 May 2002
Abstract :
In this paper, we address the problem of computational auditory scene recognition and describe methods to classify auditory scenes into predefined classes. By auditory scene recognition we mean recognition of an environment using audio information only. The auditory scenes comprised tens of everyday outside and inside environments, such as streets, restaurants, offices, family homes, and cars. Two completely different but almost equally effective classification systems were used: band-energy ratio features with 1-NN classifier and Mel-frequency cepstral coefficients with Gaussian mixture models. The best obtained recognition rate for 17 different scenes out of 26 and for an analysis duration of 30 seconds was 68.4%. For comparison, the recognition accuracy of humans was 70% for 25 different scenes and the average response time was around 20 seconds. The efficiency of different acoustic features and the effect of test sequence length were studied.
Keywords :
Artificial neural networks; Libraries; Mel frequency cepstral coefficient; Rail transportation; Roads; Vehicles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5745009
Filename :
5745009
Link To Document :
بازگشت