DocumentCode
788563
Title
Parametric Representations of Bird Sounds for Automatic Species Recognition
Author
Somervuo, Panu ; Härmä, Aki ; Fagerlund, Seppo
Author_Institution
Helsinki Univ.
Volume
14
Issue
6
fYear
2006
Firstpage
2252
Lastpage
2263
Abstract
This paper is related to the development of signal processing techniques for automatic recognition of bird species. Three different parametric representations are compared. The first representation is based on sinusoidal modeling which has been earlier found useful for highly tonal bird sounds. Mel-cepstrum parameters are used since they have been found very useful in the parallel problem of speech recognition. Finally, a vector of various descriptive features is tested because such models are popular in audio classification applications, and bird song is almost like music. We briefly introduce the methods and evaluate their performance in the classification and recognition of both individual syllables and song fragments of 14 common North-European Passerine bird species
Keywords
acoustic signal processing; bioacoustics; biocommunications; cepstral analysis; signal classification; signal representation; automatic species recognition; bird song; bird sounds; mel-cepstrum parameters; parametric representations; signal processing techniques; sinusoidal modeling; Acoustic signal processing; Birds; Books; Hidden Markov models; Pattern recognition; Sonogram; Spectrogram; Speech recognition; Testing; Vocabulary; Bird song; Gaussian mixture model (GMM); dynamic time warping (DTW); feature extraction; hidden Markov model (HMM); sinusoidal modeling;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2006.872624
Filename
1709912
Link To Document