• DocumentCode
    3415225
  • Title

    Bird-phrase segmentation and verification: A noise-robust template-based approach

  • Author

    Kaewtip, Kantapon ; Lee Ngee Tan ; Taylor, Charles E. ; Alwan, Abeer

  • Author_Institution
    Dept. of Electr. Eng., Univ. of California, Los Angeles, Los Angeles, CA, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    758
  • Lastpage
    762
  • Abstract
    In this paper, we present a birdsong-phrase segmentation and verification algorithm that is robust to limited training data, class variability, and noise. The algorithm comprises a noise-robust, Dynamic-Time-Warping (DTW)-based segmentation and a discriminative classifier for outlier rejection. The algorithm utilizes DTW and prominent (high energy) time-frequency regions of training spectrograms to derive a reliable noise-robust template for each phrase class. The resulting template is then used for segmenting continuous recordings to obtain segment candidates whose spectrogram amplitudes in the prominent regions are used as features to a Support Vector Machine (SVM). The algorithm is evaluated on the Cassin´s Vireo recordings; our proposed system yields low Equal Error Rates (EER) and segment boundaries that are close to those obtained from manual annotations and, is better than energy or entropy-based birdsong segmentation algorithms. In the presence of additive noise (-10 to 10 dB SNR), the proposed phrase detection system does not degrade as significantly as the other algorithms do.
  • Keywords
    acoustic noise; support vector machines; Cassin´s Vireo recordings; DTW-based segmentation; SVM; additive noise; birdsong-phrase segmentation; class variability; discriminative classifier; dynamic-time-warping-based segmentation; entropy-based birdsong segmentation algorithms; limited training data; low equal error rates; manual annotations; noise-robust template-based approach; outlier rejection; phrase detection system; reliable noise-robust template; segment boundaries; segmenting continuous recording; spectrogram amplitudes; support vector machine; time-frequency region; training spectrograms; verification algorithm; Birds; Classification algorithms; Noise; Noise measurement; Spectrogram; Support vector machines; Training; SVM; bird phrase detection; dynamic time-warping; limited data; noise-robust; template-based;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178071
  • Filename
    7178071