• DocumentCode
    1849075
  • Title

    Audio thumbnailing in video sharing sites

  • Author

    Pikrakis, Aggelos

  • Author_Institution
    Dept. of Inf., Univ. of Piraeus, Piraeus, Greece
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    1284
  • Lastpage
    1288
  • Abstract
    This paper presents a variant of the Smith and Waterman algorithm that operates adaptively on a continuous feature space of MPEG-7 low level spectral descriptors and is capable of detecting repeating patterns (thumbnails) in audio streams that stem from shared Internet videos. The proposed method is not restricted to specific audio types and does not rely on training data. It has been studied in the context of four frequently encountered categories of audio streams, including TV shows, cover versions of music tracks, history documentaries and animal sounds. The results are encouraging and indicate that the presented scheme provides, in the general case, meaningful thumbnails and exhibits acceptable robustness with respect to audio recording quality.
  • Keywords
    Internet; audio recording; audio signal processing; video coding; MPEG-7 low level spectral descriptors; Smith algorithm; Waterman algorithm; audio recording quality; audio thumbnailing; continuous feature space; shared Internet videos; video sharing sites; Animals; Context; Feature extraction; Music; Speech; TV; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • Conference_Location
    Bucharest
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6333942