Title :
Temporal events in all dimensions and scales
Author :
Slaney, Malcolm ; Ponceleon, Dulce ; Kaufman, James
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
This paper describes a new representation for the audio and visual information in a video signal. We use reduce the dimensionality of the signals with singular-value decomposition (SVD) or mel-frequency cepstral coefficients (MFCC). We apply these transforms to word, (word transcript, semantic space or latent semantic indexing), image (color histogram data) and audio (timbre) data. Using scale-space techniques we find large jumps in a video´s path, which are evidence for events. We use these techniques to analyze the temporal properties of the audio and image data in a video. This analysis creates a hierarchical segmentation of the video, or a table-of-contents, from both audio and the image data
Keywords :
audio signal processing; cepstral analysis; image colour analysis; image segmentation; signal representation; singular value decomposition; video signal processing; SVD; audio data; audio information representation; color histogram data; hierarchical video segmentation; image data; latent semantic indexing; mel-frequency cepstral coefficients; scale-space techniques; semantic space; signal dimensionality reduction; singular-value decomposition; table-of-contents; temporal events; temporal properties; timbre; transforms; video signal; visual information representation; word transcript; Algorithm design and analysis; Cepstral analysis; Event detection; Histograms; Image analysis; Image edge detection; Image segmentation; Indexing; Mel frequency cepstral coefficient; Timbre;
Conference_Titel :
Detection and Recognition of Events in Video, 2001. Proceedings. IEEE Workshop on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7695-1293-3
DOI :
10.1109/EVENT.2001.938870