DocumentCode :
1593858
Title :
Video semantic mining and annotation
Author :
Zhang, Shilin
Author_Institution :
Computer Faculty, North China University of Technology, Beijing, China
fYear :
2012
Firstpage :
1
Lastpage :
3
Abstract :
Speech signal, video caption text and video frame images are all key factors for a person to understand the video content. Through above observation, we bring forward a scheme which integrating continuous speech recognition, video caption text recognition and object recognition. The video is firstly segmented into a number of shots by shot detection. The object recognition results are also presented in the same way. The above three folds of texts are processed by part of speech and stemming and finally are represented by three bags of words. Only the noun and verb words are kept. The words are further depicted as a graph. The graph vertices stand for the words and the edges denote the semantic relation between two neighboring words. In the last step, we apply the dense sub graph finding method to mine the video semantic meaning. Experiments show that our video semantic mining method is efficient.
Keywords :
Auto Speech Recognition; Information Fusion; Object Recognition; Video Semantic Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
World Automation Congress (WAC), 2012
Conference_Location :
Puerto Vallarta, Mexico
ISSN :
2154-4824
Print_ISBN :
978-1-4673-4497-5
Type :
conf
Filename :
6321812
Link To Document :
بازگشت