Title :
Image and audio sequence visualization and interaction mechanisms for structured video browsing and editing
Author :
Toklu, Candemir ; Liou, Shih-Ping
Author_Institution :
Dept. of Multimedia & Video Technol., Siemens Corp. Res. Inc., Princeton, NJ, USA
Abstract :
We discuss extensions to our video browsing tool described by Hjelsvold et al. (see Handbook of Internet and multimedia systems and applications, 1998). We propose to include audio related informative visual content into our tool. Hence, we suggest representing the audio track of the video by its spectrogram image and pitch curve to enhance the video and audio related information available to the user. This representation also facilitates the correction of automatically computed audio event boundaries and introduction of speaker segments. We also provide a real-time approach for segmenting audio into events, namely, silence, speech and non-speech, to further enhance the audio information space.
Keywords :
audio signals; image representation; image sequences; online front-ends; spectral analysis; video signal processing; audio event boundaries; audio related information enhancement; audio segmentation; audio sequence visualization; audio track representation; image sequence visualization; interaction mechanisms; non-speech event; pitch curve; real-time approach; silence; speaker segments; spectrogram image; speech event; structured video browsing; structured video editing; video browsing tool; video related information enhancement; Bandwidth; Computer networks; Educational institutions; Humans; Image segmentation; Image sequences; Spectrogram; Speech analysis; Speech enhancement; Visualization;
Conference_Titel :
Image Processing, 2000. Proceedings. 2000 International Conference on
Conference_Location :
Vancouver, BC, Canada
Print_ISBN :
0-7803-6297-7
DOI :
10.1109/ICIP.2000.899296