Title :
Visual and acoustic identification of bird species
Author :
Marini, A. ; Turatti, A.J. ; Britto, A.S. ; Koerich, A.L.
Author_Institution :
Postgrad. Program in Inf., Pontifical Catholic Univ. of Parana, Curitiba, Brazil
Abstract :
This paper presents a novel approach for bird species identification that relies on both visual features extracted from unconstrained bird images and acoustic features extracted from bird vocalizations. The Scale Invariant Feature Transform (SIFT) detects local features in bird images, which are then used to train a support vector machine classifier. The instances that are not classified with a certain degree of certainty are then rejected and reclassified using Mel-frequency cepstral coefficients (MFCCs) extracted from the bird songs if available. Experiments conducted on a dataset of 50 bird species that comprise images from the CUB200-2011 and audio samples from Xeno-Canto have shown that improvements between 1.2 and 15.7 percentage points are achieved when using an acoustic classifier to re-process the instances rejected by the visual classifier, depending on the rejection level.
Keywords :
audio acoustics; scaling phenomena; support vector machines; CUB200-2011; MFCC; Mel-frequency cepstral coefficients; SIFT; Xeno-Canto; acoustic features; acoustic identification; audio samples; bird species; scale invariant feature transform; support vector machine classifier; visual classifier; visual identification; Birds; Feature extraction; Mel frequency cepstral coefficient; Monitoring; Support vector machines; Visualization; MFCC; SIFT; combination of classifiers; fine-grained classification; fusion of information;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178383