DocumentCode
1863109
Title
A fusion scheme of visual and auditory modalities for event detection in sports video
Author
Xu, Min ; Duan, Ling-Yu ; Xu, Chang-Sheng ; Tian, Qi
Author_Institution
Inst. for Infocomm Res., Singapore, Singapore
Volume
1
fYear
2003
fDate
6-9 July 2003
Abstract
In this paper, we propose an effective fusion scheme of visual and auditory modalities to detect events in sports video. The proposed scheme is built upon semantic shot classification, where we classify video shots into several major or interesting classes, each of which has clear semantic meanings. Among major shot classes we perform classification of the different auditory signal segments (i.e. silence, hitting ball, applause, commentator speech) with the goal of detecting events with strong semantic meaning. For instance, for tennis video, we have identified five interesting events: serve, reserve, ace, return, and score. Since we have developed a unified framework for semantic shot classification in sports videos and a set of audio mid-level representation with supervised learning methods, the proposed fusion scheme can be easily adapted to a new sports game. We are extending this fusion scheme to three additional typical sports videos: basketball, volleyball and soccer. Correctly detected sports video events will greatly facilitate further structural and temporal analysis, such as sports video skimming, table of content, etc.
Keywords
audio signal processing; audio-visual systems; sport; video signal processing; audio mid-level representation; auditory modalities; auditory signal segments; event detection; fusion scheme; semantic shot classification; sports video; supervised learning methods; tennis video; visual modalities; Cameras; Event detection; Games; Gunshot detection systems; Hidden Markov models; Indexing; Pattern recognition; Speech; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN
0-7803-7965-9
Type
conf
DOI
10.1109/ICME.2003.1220922
Filename
1220922
Link To Document