• DocumentCode
    3517494
  • Title

    A regularized kernel-based approach to unsupervised audio segmentation

  • Author

    Harchaoui, Zaïd ; Vallet, Félicien ; Lung-Yut-Fong, Alexandre ; Cappé, Olivier

  • Author_Institution
    CNRS, LTCI, TELECOM ParisTech, Paris
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    1665
  • Lastpage
    1668
  • Abstract
    We introduce a regularized kernel-based rule for unsupervised change detection based on a simpler version of the recently proposed kernel fisher discriminant ratio. Compared to other kernel-based change detectors found in the literature, the proposed test statistic is easier to compute and has a known asymptotic distribution which can effectively be used to set the false alarm rate a priori. This technique is applied for segmenting tracks from TV shows, both for segmentation into semantically homogeneous sections (applause, movie, music, etc.) and for speaker diarization within the speech sections. On these tasks, the proposed approach outperforms other kernel-based tests and is competitive with a standard HMM-based supervised alternative.
  • Keywords
    audio signal processing; hidden Markov models; signal detection; hidden Markov model; kernel fisher discriminant ratio; regularized kernel-based approach; speaker diarization; unsupervised audio segmentation; unsupervised change detection; Detectors; Kernel; Motion pictures; Speech; Statistical analysis; Statistical distributions; Streaming media; TV; Telecommunications; Testing; Change detection; audio segmentation; kernel methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959921
  • Filename
    4959921