DocumentCode :
3498404
Title :
Personalized video summary using visual semantic annotations and automatic speech transcriptions
Author :
Tseng, Belle L. ; Lin, Ching-Yung
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
fYear :
2002
fDate :
9-11 Dec. 2002
Firstpage :
5
Lastpage :
8
Abstract :
A personalized video summary is dynamically generated in our video personalization and summary system based on user preference and usage environment. The three-tier personalization system adopts the server-middleware-client architecture in order maintain, select, adapt, and deliver rich media content to the user. The server stores the content sources along with their corresponding MPEG-7 metadata descriptions. In this paper, the metadata includes visual semantic annotations and automatic speech transcriptions. Our personalization and summarization engine in the middleware selects the optimal set of desired video segments by matching shot annotations and sentence transcripts with user preferences. The process includes the shot-to-sentence alignment, summary segment selection, and user preference matching and propagation. As a result, the relevant visual shot and audio sentence segments are aggregated and composed into a personalized video summary.
Keywords :
image segmentation; meta data; middleware; semantic networks; speech recognition; video servers; video signal processing; MPEG-7 metadata; audio sentence segments; automatic speech transcriptions; content sources; personalization engines; personalized video summary; server stores; server-middleware-client architecture; shot annotation matching; shot-to-sentence alignment; summarization engines; summary segment selection; three-tier personalization system; user preference matching; user preference propagation; video segments; visual semantic annotations; visual shot; Cellular phones; Content based retrieval; Displays; Engines; Layout; MPEG 7 Standard; Middleware; Network servers; Singular value decomposition; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2002 IEEE Workshop on
Print_ISBN :
0-7803-7713-3
Type :
conf
DOI :
10.1109/MMSP.2002.1203234
Filename :
1203234
Link To Document :
بازگشت