DocumentCode :
2150515
Title :
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations
Author :
Weninger, Felix ; Schuller, Björn
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munich, Germany
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
337
Lastpage :
340
Abstract :
We present a study on purely data-based recognition of animal sounds, performing evaluation on a real-world database obtained from the Humboldt-University Animal Sound Archive. As we avoid a preselection of friendly cases, the challenge for the classifiers is to discriminate between species regardless of the age or stance of the animal. We define classification tasks that can be useful for information retrieval and indexing, facilitating categorization of large sound archives. On these tasks, we compare dynamic and static classification by left-right and cyclic Hidden Markov Models, recurrent neural networks with Long Short-Term Memory, and Support Vector Machines, as well as different features commonly found in sound classification and speech recognition, achieving up to 81.3 % accuracy on a 2-class, and 64.0 % on a 5-class task.
Keywords :
hidden Markov models; indexing; information retrieval; neural nets; speech recognition; support vector machines; Humboldt-University Animal Sound Archive; Long Short-Term Memory; Support Vector Machine; animal vocalization; audio recognition; categorization; cyclic Hidden Markov Model; dynamic classification; information indexing; information retrieval; real-world database; recurrent neural networks; speech recognition; static classification; Animals; Databases; Feature extraction; Hidden Markov models; Speech recognition; Support vector machines; Training; Audio Pattern Recognition; Bioacoustics; Sound Classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5946409
Filename :
5946409
Link To Document :
بازگشت