Title :
Video classification using transform coefficients
Author :
Girgensohn, A. ; Foote, Jonothnn
Author_Institution :
FX Palo Alto Lab., CA, USA
Abstract :
This paper describes techniques for classifying video frames using statistical models of reduced DCT or Hadamard transform coefficients. When decimated in time and reduced using truncation or principal component analysis, transform coefficients taken across an entire frame image allow rapid modeling, segmentation and similarity calculation. Unlike color-histogram metrics, this approach models image composition and works on grayscale images. Modeling the statistics of the transformed video frame images gives a likelihood measure that allows video to be segmented, classified, and ranked by similarity for retrieval. Experiments are presented that show an 87% correct classification rate for different classes. Applications are presented including a content-aware video browser
Keywords :
Hadamard transforms; content-based retrieval; discrete cosine transforms; image classification; image segmentation; principal component analysis; video signal processing; DCT; Hadamard transform; content-aware video browser; grayscale images; image composition; likelihood measure; principal component analysis; rapid modeling; segmentation; similarity; statistical models; transform coefficients; truncation; video classification; video frames; Discrete cosine transforms; Histograms; Image analysis; Image color analysis; Image retrieval; Image segmentation; Pixel; Principal component analysis; Testing; Video compression;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.757483