DocumentCode :
431598
Title :
Towards a unified framework for content-based audio analysis
Author :
Lu, Lie ; Cai, Rui ; Hanjalic, Alan
Author_Institution :
Microsoft Res. Asia, Beijing, China
Volume :
2
fYear :
2005
fDate :
18-23 March 2005
Abstract :
Audio content analysis is helpful in many multimedia applications. We present a unified framework for content analysis of composite audio. The framework is designed to extract relevant information from different available audio modalities and to discover high-level semantics conveyed by the data. We also demonstrate an implementation of the proposed framework for the detection of scenes and events in various TV shows and movies, in which key audio effects are first extracted as a midlevel representation, and then a Bayesian network is used for high-level semantics inference. Experiments on 12-hour audio data indicate that the proposed framework has a satisfying performance.
Keywords :
audio signal processing; belief networks; inference mechanisms; pattern classification; signal classification; Bayesian network; audio effects extraction; audio modalities; composite audio content analysis; content-based audio analysis; high-level semantics; midlevel representations; unified framework; Data mining; Event detection; Indexing; Layout; Motion pictures; Multimedia databases; Radio broadcasting; Speech; Streaming media; TV;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1415593
Filename :
1415593
Link To Document :
بازگشت