DocumentCode :
3415225
Title :
Bird-phrase segmentation and verification: A noise-robust template-based approach
Author :
Kaewtip, Kantapon ; Lee Ngee Tan ; Taylor, Charles E. ; Alwan, Abeer
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, Los Angeles, CA, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
758
Lastpage :
762
Abstract :
In this paper, we present a birdsong-phrase segmentation and verification algorithm that is robust to limited training data, class variability, and noise. The algorithm comprises a noise-robust, Dynamic-Time-Warping (DTW)-based segmentation and a discriminative classifier for outlier rejection. The algorithm utilizes DTW and prominent (high energy) time-frequency regions of training spectrograms to derive a reliable noise-robust template for each phrase class. The resulting template is then used for segmenting continuous recordings to obtain segment candidates whose spectrogram amplitudes in the prominent regions are used as features to a Support Vector Machine (SVM). The algorithm is evaluated on the Cassin´s Vireo recordings; our proposed system yields low Equal Error Rates (EER) and segment boundaries that are close to those obtained from manual annotations and, is better than energy or entropy-based birdsong segmentation algorithms. In the presence of additive noise (-10 to 10 dB SNR), the proposed phrase detection system does not degrade as significantly as the other algorithms do.
Keywords :
acoustic noise; support vector machines; Cassin´s Vireo recordings; DTW-based segmentation; SVM; additive noise; birdsong-phrase segmentation; class variability; discriminative classifier; dynamic-time-warping-based segmentation; entropy-based birdsong segmentation algorithms; limited training data; low equal error rates; manual annotations; noise-robust template-based approach; outlier rejection; phrase detection system; reliable noise-robust template; segment boundaries; segmenting continuous recording; spectrogram amplitudes; support vector machine; time-frequency region; training spectrograms; verification algorithm; Birds; Classification algorithms; Noise; Noise measurement; Spectrogram; Support vector machines; Training; SVM; bird phrase detection; dynamic time-warping; limited data; noise-robust; template-based;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178071
Filename :
7178071
Link To Document :
بازگشت