Title :
Exploring audio semantic concepts for event-based video retrieval
Author :
Yipei Wang ; Rawat, Seema ; Metze, Florian
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
The audio semantic concepts (sound events) play important roles in audio-based content analysis. How to capture the semantic information effectively from the complex occurrence pattern of sound events in YouTube quality videos is a challenging problem. This paper presents a novel framework to handle the complex situation for semantic information extraction in real-world videos and evaluate through the NIST multimedia event detection task (MED). We calculate the occurrence confidence matrix of sound events and explore multiple strategies to generate clip-level semantic features from the matrix. We evaluate the performance using TRECVID2011 MED dataset. The proposed method outperforms previous HMM-based system. The late fusion experiment with the low-level features and text feature (ASR) shows that audio semantic concepts capture complementary information in the soundtrack.
Keywords :
audio signal processing; matrix algebra; video retrieval; HMM-based system; NIST multimedia event detection task; TRECVID2011 MED dataset; YouTube quality videos; audio semantic concepts; audio-based content analysis; clip-level semantic features; complex occurrence pattern; event-based video retrieval; low-level features; occurrence confidence matrix; semantic information; sound events; text feature; Event detection; Feature extraction; Hidden Markov models; Multimedia communication; Semantics; Speech; Vectors; audio processing; multimedia retrieval; semantic concept;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853819