مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards coherent natural language description of video streams

DocumentCode :

3018183

Title :

Towards coherent natural language description of video streams

Author :

Khan, Muhammad Usman Ghani ; Lei Zhang ; Gotoh, Yusuke

Author_Institution :

Univ. of Sheffield, Sheffield, UK

fYear :

2011

fDate :

6-13 Nov. 2011

Firstpage :

664

Lastpage :

671

Abstract :

This contribution addresses the approach to creating smooth and coherent description of video streams. Firstly conventional image processing techniques are applied to extract high level features from individual video frames. Natural language description of the frame contents is produced based on high level features. In order to extend the approach to description of video streams, we introduce units of features and overview how units can be used to present coherent, smooth and well phrased descriptions by incorporating spatial and temporal information. The approach is evaluated by calculating overlap similarity score between human authored and machine generated descriptions.

Keywords :

feature extraction; image sequences; natural language processing; video streaming; coherent natural language description; high level feature extraction; image processing techniques; spatial information; temporal information; video frames; video streaming; Humans; Image color analysis; Legged locomotion; Natural languages; Streaming media; Video sequences; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on

Conference_Location :

Barcelona

Print_ISBN :

978-1-4673-0062-9

Type :

conf

DOI :

10.1109/ICCVW.2011.6130306

Filename :

6130306

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3018183