Title :
Combined audio and visual streams analysis for video sequence segmentation
Author :
Nam, JeHo ; Tewfik, Ahmed H.
Author_Institution :
Dept. of Electr. Eng., Minnesota Univ., Minneapolis, MN, USA
Abstract :
We present a new approach to video sequence segmentation into individual shots. Unlike previous approaches, our technique segments the video sequence by combining two streams of information extracted from the visual track with audio track segmentation information. The visual streams of information are computed from the coarse data in a 3-D wavelet decomposition of the video track. They consist of (i) information derived from temporal edges detected along the time evolution of the intensity of each pixel in temporally sub-sampled spatially filtered coarse frames, and (ii) information derived from the coarse spatio-temporal evolution of intra-frame edges in the spatially filtered coarse frames. Our approach is particularly matched to progressively transmitted video
Keywords :
acoustic signal processing; audio signals; edge detection; image segmentation; image sequences; spatial filters; video signal processing; wavelet transforms; 3-D wavelet decomposition; audio track; coarse data; combined audio visual streams analysis; intensity; intra-frame edges; progressively transmitted video; spatio-temporal evolution; temporal edges; temporally sub-sampled spatially filtered coarse frames; video sequence segmentation; visual track; Content based retrieval; Data mining; Gunshot detection systems; Indexing; Information filtering; Information filters; Speech; Streaming media; TV; Video sequences;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.595337