Title :
Interactive video retrieval using embedded audio content
Author :
Amin, Tanvir ; Guan, L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Ryerson Univ., Toronto, Ont., Canada
Abstract :
Audio is a rich source of information in the digital videos that can provide useful descriptors for indexing the video databases. In this paper, we model the shape of the distribution of wavelet coefficients of embedded audio with a Laplacian mixture. The distributions of wavelet coefficients are very peaky in nature. The shape of these distributions can be modeled with only two components in the Laplacian mixture with low computational complexity. The parameters of this mixture model form a low dimensional feature vector representing global similarity of the audio content of the video clips. An interactive approach involving the feature vector updating scheme is used to adapt the retrieval system to the users´ needs. This relevance feedback (RF) increases the retrieval ratio substantially. A comprehensive experimental evaluation using the CNN news database has been performed.
Keywords :
content-based retrieval; database indexing; feature extraction; image retrieval; interactive systems; multimedia databases; relevance feedback; video databases; wavelet transforms; CNN news database; Laplacian mixture; audio content; digital videos; distribution shape; embedded audio content; feature vector; feature vector updating; global similarity representation; interactive video retrieval; relevance feedback; retrieval ratio; video clips; video database indexing; wavelet coefficients; Audio databases; Computational complexity; Content based retrieval; Feedback; Indexing; Information resources; Laplace equations; Radio frequency; Shape; Wavelet coefficients;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326578