DocumentCode :
439024
Title :
Efficient identification of speakers in news video based on shot segmentation
Author :
Chang, Qing ; Xue, Xiang-Yang ; Lu, Hong ; Nie, You-San
Author_Institution :
Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai, China
Volume :
2
fYear :
2004
fDate :
6-9 Dec. 2004
Firstpage :
1533
Abstract :
An effective method for speaker identification in news video is presented in this paper, which is based on shot segmentation and exploits both audio and visual cues. Firstly, audio is segmented by shot segmentation based on the observation that there is only one speaker in a shot of news video in most cases. Furthermore, speech/non-speech discrimination is implemented on each shot. Finally, text-independent speaker identification is proposed using audio features on the discriminated speech shots. Experimental results show that our algorithm can obtain satisfactory performance in identifying speakers, so it can be used in real application.
Keywords :
image segmentation; speaker recognition; video signal processing; news video; shot segmentation; speech discrimination; text-independent speaker identification; Bayesian methods; Computer science; Gunshot detection systems; Hidden Markov models; Information retrieval; Loudspeakers; Multimedia computing; Neural networks; Speech; Video compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th
Print_ISBN :
0-7803-8653-1
Type :
conf
DOI :
10.1109/ICARCV.2004.1469078
Filename :
1469078
Link To Document :
بازگشت