DocumentCode :
2931412
Title :
Audio contributions to semantic video search
Author :
Trancoso ; Pellegrini, T. ; Elo, J. Port ; Meinedo, H. ; Bugalho, M. ; Abad, A. ; Neto, J.
Author_Institution :
INESC-ID Lisboa, Portugal
fYear :
2009
fDate :
June 28 2009-July 3 2009
Firstpage :
630
Lastpage :
633
Abstract :
This paper summarizes the contributions to semantic video search that can be derived from the audio signal. Because of space restrictions, the emphasis will be on non-linguistic cues. The paper thus covers what is generally known as audio segmentation, as well as audio event detection. Using machine learning approaches, we have built detectors for over 50 semantic audio concepts.
Keywords :
audio signal processing; learning (artificial intelligence); search engines; video signal processing; audio contributions; audio event detection; audio segmentation; audio signal; machine learning approaches; nonlinguistic cues; semantic video search; space restrictions; Acoustic signal detection; Detectors; Event detection; Feature extraction; Hidden Markov models; Linear discriminant analysis; Loudspeakers; Machine learning; Principal component analysis; Speech recognition; Audio Event Detection; Audio Segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
ISSN :
1945-7871
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2009.5202575
Filename :
5202575
Link To Document :
بازگشت