• DocumentCode
    2770916
  • Title

    Audio Classification of Bird Species: A Statistical Manifold Approach

  • Author

    Briggs, Forrest ; Raich, Raviv ; Fern, Xiaoli Z.

  • Author_Institution
    Sch. of EECS, Oregon State Univ., Corvallis, OR, USA
  • fYear
    2009
  • fDate
    6-9 Dec. 2009
  • Firstpage
    51
  • Lastpage
    60
  • Abstract
    Our goal is to automatically identify which species of bird is present in an audio recording using supervised learning. Devising effective algorithms for bird species classification is a preliminary step toward extracting useful ecological data from recordings collected in the field. We propose a probabilistic model for audio features within a short interval of time, then derive its Bayes risk-minimizing classifier, and show that it is closely approximated by a nearest-neighbor classifier using Kullback-Leibler divergence to compare histograms of features. We note that feature histograms can be viewed as points on a statistical manifold, and KL divergence approximates geodesic distances defined by the Fisher information metric on such manifolds. Motivated by this fact, we propose the use of another approximation to the Fisher information metric, namely the Hellinger metric. The proposed classifiers achieve over 90% accuracy on a data set containing six species of bird, and outperform support vector machines.
  • Keywords
    audio recording; audio signal processing; learning (artificial intelligence); signal classification; statistical analysis; support vector machines; telecommunication computing; Bayes risk-minimizing classifier; Fisher information metric; Hellinger metric; Kullback-Leibler divergence; audio classification; audio recording; bird species; nearest-neighbor classifier; probabilistic model; statistical manifold approach; supervised learning; support vector machines; Automobiles; Birds; Clustering algorithms; Costs; Data mining; Humans; Partitioning algorithms; Personnel; Training data; Vocabulary; audio; bayes; classification; clustering; codebook; geodesic; manifold; map; maximum a-posteriori; mfccs; nearest neighbor;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-5242-2
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2009.65
  • Filename
    5360230