• DocumentCode
    2150515
  • Title

    Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations

  • Author

    Weninger, Felix ; Schuller, Björn

  • Author_Institution
    Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munich, Germany
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    337
  • Lastpage
    340
  • Abstract
    We present a study on purely data-based recognition of animal sounds, performing evaluation on a real-world database obtained from the Humboldt-University Animal Sound Archive. As we avoid a preselection of friendly cases, the challenge for the classifiers is to discriminate between species regardless of the age or stance of the animal. We define classification tasks that can be useful for information retrieval and indexing, facilitating categorization of large sound archives. On these tasks, we compare dynamic and static classification by left-right and cyclic Hidden Markov Models, recurrent neural networks with Long Short-Term Memory, and Support Vector Machines, as well as different features commonly found in sound classification and speech recognition, achieving up to 81.3 % accuracy on a 2-class, and 64.0 % on a 5-class task.
  • Keywords
    hidden Markov models; indexing; information retrieval; neural nets; speech recognition; support vector machines; Humboldt-University Animal Sound Archive; Long Short-Term Memory; Support Vector Machine; animal vocalization; audio recognition; categorization; cyclic Hidden Markov Model; dynamic classification; information indexing; information retrieval; real-world database; recurrent neural networks; speech recognition; static classification; Animals; Databases; Feature extraction; Hidden Markov models; Speech recognition; Support vector machines; Training; Audio Pattern Recognition; Bioacoustics; Sound Classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946409
  • Filename
    5946409