DocumentCode
1849075
Title
Audio thumbnailing in video sharing sites
Author
Pikrakis, Aggelos
Author_Institution
Dept. of Inf., Univ. of Piraeus, Piraeus, Greece
fYear
2012
fDate
27-31 Aug. 2012
Firstpage
1284
Lastpage
1288
Abstract
This paper presents a variant of the Smith and Waterman algorithm that operates adaptively on a continuous feature space of MPEG-7 low level spectral descriptors and is capable of detecting repeating patterns (thumbnails) in audio streams that stem from shared Internet videos. The proposed method is not restricted to specific audio types and does not rely on training data. It has been studied in the context of four frequently encountered categories of audio streams, including TV shows, cover versions of music tracks, history documentaries and animal sounds. The results are encouraging and indicate that the presented scheme provides, in the general case, meaningful thumbnails and exhibits acceptable robustness with respect to audio recording quality.
Keywords
Internet; audio recording; audio signal processing; video coding; MPEG-7 low level spectral descriptors; Smith algorithm; Waterman algorithm; audio recording quality; audio thumbnailing; continuous feature space; shared Internet videos; video sharing sites; Animals; Context; Feature extraction; Music; Speech; TV; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location
Bucharest
ISSN
2219-5491
Print_ISBN
978-1-4673-1068-0
Type
conf
Filename
6333942
Link To Document