DocumentCode :
2291606
Title :
Combined audio and visual streams analysis for video sequence segmentation
Author :
Nam, JeHo ; Tewfik, Ahmed H.
Author_Institution :
Dept. of Electr. Eng., Minnesota Univ., Minneapolis, MN, USA
Volume :
4
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
2665
Abstract :
We present a new approach to video sequence segmentation into individual shots. Unlike previous approaches, our technique segments the video sequence by combining two streams of information extracted from the visual track with audio track segmentation information. The visual streams of information are computed from the coarse data in a 3-D wavelet decomposition of the video track. They consist of (i) information derived from temporal edges detected along the time evolution of the intensity of each pixel in temporally sub-sampled spatially filtered coarse frames, and (ii) information derived from the coarse spatio-temporal evolution of intra-frame edges in the spatially filtered coarse frames. Our approach is particularly matched to progressively transmitted video
Keywords :
acoustic signal processing; audio signals; edge detection; image segmentation; image sequences; spatial filters; video signal processing; wavelet transforms; 3-D wavelet decomposition; audio track; coarse data; combined audio visual streams analysis; intensity; intra-frame edges; progressively transmitted video; spatio-temporal evolution; temporal edges; temporally sub-sampled spatially filtered coarse frames; video sequence segmentation; visual track; Content based retrieval; Data mining; Gunshot detection systems; Indexing; Information filtering; Information filters; Speech; Streaming media; TV; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.595337
Filename :
595337
Link To Document :
بازگشت