DocumentCode
3517494
Title
A regularized kernel-based approach to unsupervised audio segmentation
Author
Harchaoui, Zaïd ; Vallet, Félicien ; Lung-Yut-Fong, Alexandre ; Cappé, Olivier
Author_Institution
CNRS, LTCI, TELECOM ParisTech, Paris
fYear
2009
fDate
19-24 April 2009
Firstpage
1665
Lastpage
1668
Abstract
We introduce a regularized kernel-based rule for unsupervised change detection based on a simpler version of the recently proposed kernel fisher discriminant ratio. Compared to other kernel-based change detectors found in the literature, the proposed test statistic is easier to compute and has a known asymptotic distribution which can effectively be used to set the false alarm rate a priori. This technique is applied for segmenting tracks from TV shows, both for segmentation into semantically homogeneous sections (applause, movie, music, etc.) and for speaker diarization within the speech sections. On these tasks, the proposed approach outperforms other kernel-based tests and is competitive with a standard HMM-based supervised alternative.
Keywords
audio signal processing; hidden Markov models; signal detection; hidden Markov model; kernel fisher discriminant ratio; regularized kernel-based approach; speaker diarization; unsupervised audio segmentation; unsupervised change detection; Detectors; Kernel; Motion pictures; Speech; Statistical analysis; Statistical distributions; Streaming media; TV; Telecommunications; Testing; Change detection; audio segmentation; kernel methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4959921
Filename
4959921
Link To Document