DocumentCode :
1849075
Title :
Audio thumbnailing in video sharing sites
Author :
Pikrakis, Aggelos
Author_Institution :
Dept. of Inf., Univ. of Piraeus, Piraeus, Greece
fYear :
2012
fDate :
27-31 Aug. 2012
Firstpage :
1284
Lastpage :
1288
Abstract :
This paper presents a variant of the Smith and Waterman algorithm that operates adaptively on a continuous feature space of MPEG-7 low level spectral descriptors and is capable of detecting repeating patterns (thumbnails) in audio streams that stem from shared Internet videos. The proposed method is not restricted to specific audio types and does not rely on training data. It has been studied in the context of four frequently encountered categories of audio streams, including TV shows, cover versions of music tracks, history documentaries and animal sounds. The results are encouraging and indicate that the presented scheme provides, in the general case, meaningful thumbnails and exhibits acceptable robustness with respect to audio recording quality.
Keywords :
Internet; audio recording; audio signal processing; video coding; MPEG-7 low level spectral descriptors; Smith algorithm; Waterman algorithm; audio recording quality; audio thumbnailing; continuous feature space; shared Internet videos; video sharing sites; Animals; Context; Feature extraction; Music; Speech; TV; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location :
Bucharest
ISSN :
2219-5491
Print_ISBN :
978-1-4673-1068-0
Type :
conf
Filename :
6333942
Link To Document :
بازگشت