Title :
Speaker diarization of heterogeneous web video files: A preliminary study
Author :
Clement, Pierre ; Bazillon, Thierry ; Fredouille, Corinne
Author_Institution :
Lab. Inf. d´´Avignon, Univ. d´´Avignon, Avignon, France
Abstract :
In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal resources. Concerning multimedia, the most impressive evolution is the continuous growing success of the video sharing websites. But with this success come the difficulties to efficiently search, index and access relevant information about these documents. Speaker diarization is an important task in the overall information retrieval process. This paper describes an audio/video database, especially built for the speaker diarization task, based on different video genres. Through some preliminary experiments, it highlights the difficulties encountered in this context, mainly linked to the database heterogeneity.
Keywords :
Web sites; multimedia systems; speaker recognition; video retrieval; Internet; heterogeneous Web video files; information retrieval process; multimedia; speaker diarization; video sharing Websites; Databases; Density estimation robust algorithm; Hidden Markov models; Motion pictures; Speech; Speech processing; Streaming media; diarization error rate; heterogeneous web videos; speaker diarization;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947337