DocumentCode
2805833
Title
Hierarchical language modeling for audio events detection in a sports game
Author
Huang, Qiang ; Cox, Stephen
Author_Institution
Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
fYear
2010
fDate
14-19 March 2010
Firstpage
2286
Lastpage
2289
Abstract
We investigate the automatic labelling of “events” from an audio recording of a sports game. We describe a technique that utilises a hierarchy of language models, which are a low-level model of acoustic observations and a high-level model of audio events that occur during a game: these models are integrated using a maximum entropy approach. Our models of the audio events also utilise duration and voicing information as well as spectral content, and we show that further discrimination between events is possible using these features. Results on different tennis games show that the use of these techniques is better than using an approach that does not use modelling of dependencies between frames and events or extra information in the form of duration and voicing.
Keywords
acoustic signal detection; audio recording; entropy; sport; acoustic observations; audio events detection; audio recording; hierarchical language modeling; maximum entropy; sports game; Audio recording; Entropy; Event detection; Games; Hidden Markov models; Information resources; Labeling; Natural languages; Speech recognition; Support vector machines; Audio Event Detection; Language Modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495935
Filename
5495935
Link To Document