Title :
Towards a unified framework for content-based audio analysis
Author :
Lu, Lie ; Cai, Rui ; Hanjalic, Alan
Author_Institution :
Microsoft Res. Asia, Beijing, China
Abstract :
Audio content analysis is helpful in many multimedia applications. We present a unified framework for content analysis of composite audio. The framework is designed to extract relevant information from different available audio modalities and to discover high-level semantics conveyed by the data. We also demonstrate an implementation of the proposed framework for the detection of scenes and events in various TV shows and movies, in which key audio effects are first extracted as a midlevel representation, and then a Bayesian network is used for high-level semantics inference. Experiments on 12-hour audio data indicate that the proposed framework has a satisfying performance.
Keywords :
audio signal processing; belief networks; inference mechanisms; pattern classification; signal classification; Bayesian network; audio effects extraction; audio modalities; composite audio content analysis; content-based audio analysis; high-level semantics; midlevel representations; unified framework; Data mining; Event detection; Indexing; Layout; Motion pictures; Multimedia databases; Radio broadcasting; Speech; Streaming media; TV;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1415593