Title :
Enhanced video browsing using automatically extracted audio excerpts
Author :
Foote, Jonathan ; Cooper, M. ; Wilcox, Lynn
Author_Institution :
FX Palo Alto Lab., CA, USA
Abstract :
We present a method for rapidly and robustly extracting audio excerpts without the overhead of speech recognition or speaker segmentation. An immediate application is to automatically augment keyframe-based video summaries with informative audio excerpts associated with the video segments represented by the keyframes. Short audio clips combined with keyframes comprise an extremely lightweight and Web-browsable interface for auditioning video or similar media, without using bandwidth-intensive streaming video or audio.
Keywords :
audio signal processing; feature extraction; image retrieval; image segmentation; video databases; video signal processing; Manga system; Web-browsable interface; audio clips; audio excerpts; automatically augment keyframe-based video summaries; automatically extracted audio excerpts; bandwidth-intensive audio streaming; bandwidth-intensive video streaming; keyframes; multimedia browsing; multimedia documents; video browsing; video media auditioning; video segmentation; video segments; Acoustic noise; Ink; Laboratories; Navigation; Noise robustness; Speech analysis; Speech enhancement; Speech recognition; Streaming media;
Conference_Titel :
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7803-7304-9
DOI :
10.1109/ICME.2002.1035604