DocumentCode :
2173562
Title :
Speaker diarization of heterogeneous web video files: A preliminary study
Author :
Clement, Pierre ; Bazillon, Thierry ; Fredouille, Corinne
Author_Institution :
Lab. Inf. d´´Avignon, Univ. d´´Avignon, Avignon, France
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4432
Lastpage :
4435
Abstract :
In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal resources. Concerning multimedia, the most impressive evolution is the continuous growing success of the video sharing websites. But with this success come the difficulties to efficiently search, index and access relevant information about these documents. Speaker diarization is an important task in the overall information retrieval process. This paper describes an audio/video database, especially built for the speaker diarization task, based on different video genres. Through some preliminary experiments, it highlights the difficulties encountered in this context, mainly linked to the database heterogeneity.
Keywords :
Web sites; multimedia systems; speaker recognition; video retrieval; Internet; heterogeneous Web video files; information retrieval process; multimedia; speaker diarization; video sharing Websites; Databases; Density estimation robust algorithm; Hidden Markov models; Motion pictures; Speech; Speech processing; Streaming media; diarization error rate; heterogeneous web videos; speaker diarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947337
Filename :
5947337
Link To Document :
بازگشت