DocumentCode :
1872317
Title :
HMM based structuring of tennis videos using visual and audio cues
Author :
Kijak, E. ; Gravier, G. ; Gros, P. ; Oisel, L. ; Bimbot, E.
Author_Institution :
Thomson Multimedia R&D, Cesson Sevigne, France
Volume :
3
fYear :
2003
fDate :
6-9 July 2003
Abstract :
This paper focuses on the use of hidden Markov models (HMMs) for structure analysis of videos, and demonstrates how they can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The basic temporal unit is the video shot. Visual features describe the audio events within a video shot. The video structure parsing relies on the analysis of the temporal interleaving of video shots, with respect to prior information about tennis content and editing rules. As a result, typical tennis scenes are identified. In addition, each shot is assigned to a level in the hierarchy described in terms of point, game and set.
Keywords :
audio signal processing; hidden Markov models; image classification; image segmentation; multimedia systems; sport; video signal processing; HMM based structuring; audio cues; audio features; hidden Markov models; tennis videos; video structure analysis; video structure parsing; visual cues; visual features; Broadcasting; Data mining; Games; Hidden Markov models; Interleaved codes; Layout; Multimedia communication; Research and development; Streaming media; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
Type :
conf
DOI :
10.1109/ICME.2003.1221310
Filename :
1221310
Link To Document :
بازگشت