DocumentCode
2931412
Title
Audio contributions to semantic video search
Author
Trancoso ; Pellegrini, T. ; Elo, J. Port ; Meinedo, H. ; Bugalho, M. ; Abad, A. ; Neto, J.
Author_Institution
INESC-ID Lisboa, Portugal
fYear
2009
fDate
June 28 2009-July 3 2009
Firstpage
630
Lastpage
633
Abstract
This paper summarizes the contributions to semantic video search that can be derived from the audio signal. Because of space restrictions, the emphasis will be on non-linguistic cues. The paper thus covers what is generally known as audio segmentation, as well as audio event detection. Using machine learning approaches, we have built detectors for over 50 semantic audio concepts.
Keywords
audio signal processing; learning (artificial intelligence); search engines; video signal processing; audio contributions; audio event detection; audio segmentation; audio signal; machine learning approaches; nonlinguistic cues; semantic video search; space restrictions; Acoustic signal detection; Detectors; Event detection; Feature extraction; Hidden Markov models; Linear discriminant analysis; Loudspeakers; Machine learning; Principal component analysis; Speech recognition; Audio Event Detection; Audio Segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location
New York, NY
ISSN
1945-7871
Print_ISBN
978-1-4244-4290-4
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2009.5202575
Filename
5202575
Link To Document