DocumentCode :
270509
Title :
The importance of audio descriptors in automatic soccer highlights generation
Author :
Raventós, Arnau ; Quijada, Raul ; Torres, L. ; Tarrés, Francesc ; Carasusán, Eusebio ; Giribet, Daniel
Author_Institution :
Signal Theor. & Commun. Dept., UPC - Barcelona Tech., Barcelona, Spain
fYear :
2014
fDate :
11-14 Feb. 2014
Firstpage :
1
Lastpage :
6
Abstract :
Automatic generation of sports highlights from recorded audiovisual content has been object of great interest in recent years. The problem is indeed important in the production of second and third division leagues highlights videos where the quantity of raw material is significant and does not contain manual annotations. Many approaches are mostly based on the analysis of the video and disregard the important information provided by the audio track. In this paper, a new approach that combines audio and video descriptors for automatic soccer highlights generation is proposed. The approach is based on the segmentation of the video contents into shots that are further analyzed in order to determine its relevance and interest. These video-shots are scored taking into account the fusion between different audio and video features. The paper is mainly focused to emphasize the importance of audio detectors that play a key role in the analysis and scoring of the video-shots. Specifically, a new algorithm for referee´s whistle detection is proposed. The algorithm has been proven to be very robust and efficiently discriminates professional whistles against other types of noises such as public cheering-up, music instruments, etc. Several results have been produced using real soccer video sequences that prove the validity of the proposed audio and video fusion scheme.
Keywords :
acoustic signal detection; audio signal processing; feature extraction; image segmentation; sport; video signal processing; audio descriptors; audio detectors; audio features; audio track; automatic soccer highlights generation; leagues highlights videos; manual annotations; music instruments; professional whistles; public cheering-up; recorded audiovisual content; referee whistle detection; soccer video sequences; sports highlights; video descriptors; video features; video segmentation; video shots; System-on-chip; XML; audio descriptors; content analysis; multimodal processing and fusion; semantic detection; video highlights; whistle detector;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multi-Conference on Systems, Signals & Devices (SSD), 2014 11th International
Conference_Location :
Barcelona
Type :
conf
DOI :
10.1109/SSD.2014.6808845
Filename :
6808845
Link To Document :
بازگشت