DocumentCode
476228
Title
A segmentation method of news video stories based on announcer’s voiceprint
Author
Xu, Xin-Wen ; LI, Guo-hui ; Yuan, Jian
Author_Institution
Dept. of Syst. Eng., Nat. Univ. of Defense Technol., Changsha
Volume
5
fYear
2008
fDate
12-15 July 2008
Firstpage
2749
Lastpage
2753
Abstract
As an important step of content based news video retrieving and intelligence mining, semantic unit segmentation has attracted many researcherspsila interests. This paper focuses on a new method of news video stories segmentation which is based on the announcerspsila voiceprints. Firstly, the voiceprints included acoustic perception characteristics of all announcers have been extracted, and its Gaussian mixture model will be trained, then the audio clips included announcers and not announcers will be detected by the KL divergence method, at last the semantic units will be segmented under the guidance of video topic caption frames events and special structure knowledge of news program. Finally the 92.58% recall and the 96.02% precision have been achieved during more than 2 hourspsila experiment.
Keywords
Gaussian processes; content-based retrieval; data mining; image segmentation; video retrieval; video signal processing; Gaussian mixture model; KL divergence method; acoustic perception characteristics; announcer voiceprint; content based news video retrieving; intelligence mining; news video stories segmentation; semantic unit segmentation; Cybernetics; Finance; Hidden Markov models; Information retrieval; Machine learning; Management training; Spectrogram; Support vector machines; Videoconference; Weather forecasting; Gaussian mixture model; News video; Story unit segmentation; Voiceprint;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2095-7
Electronic_ISBN
978-1-4244-2096-4
Type
conf
DOI
10.1109/ICMLC.2008.4620874
Filename
4620874
Link To Document