Title :
A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure
Author :
Taufiq Hasan;Hynek Bořil;Abhijeet Sangwan;John H. L. Hansen
Author_Institution :
Center for Robust Speech Systems (CRSS), University of Texas at Dallas, Richardson, 75080, USA
fDate :
3/1/2012 12:00:00 AM
Abstract :
A generic method for sports video highlight selection is presented in this study. Processing begins where the video is divided into short segments and several multi-modal features are extracted from each video segment. Excitability is computed based on the likelihood of the features lying in certain regions of their probability density functions that are exciting and rare. The proposed measure is used to rank order the partitioned segment stream to compress the overall video sequence and produce a contiguous set of highlights. Experiments are performed on baseball videos using excitement in the commentators´ speech, audio energy, slow motion replay, scene cut density, and motion activity as features. Subjective evaluation of excitability and ranking of video segments yield a higher correlation with the proposed measure compared to well-established techniques indicating the effectiveness of the approach.
Keywords :
"Videos","Feature extraction","Speech","Games","Sports equipment","Motion segmentation","Correlation"
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Print_ISBN :
978-1-4673-0045-2
DOI :
10.1109/ICASSP.2012.6288394