• DocumentCode
    730071
  • Title

    Estimating double thumbnails for music recordings

  • Author

    Nanzhu Jiang ; Muller, Meinard

  • Author_Institution
    Int. Audio Labs. Erlangen, Erlangen, Germany
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    146
  • Lastpage
    150
  • Abstract
    Audio thumbnailing, which aims at finding the most representative audio segment of a music recording, is an important task in music information retrieval. In general, the notion of a thumbnail is not well-defined and several musical parts may be good thumbnail candidates. For example, for popular music, both a verse and a refrain section may serve as suitable thumbnail candidates. Instead of considering only one thumbnail, we consider in this paper the problem of finding the two most representative segments that correspond to different musical parts. We denote these two segments as double thumbnails. As our main technical contributions, we propose two approaches for computing double thumbnails, both extending a previously introduced repetition-based thumbnailing procedure. In the first approach, which is straightforward, we simply apply the original thumbnailing procedure two times in an iterative fashion. In the second approach, we introduce a novel method for jointly estimating the two thumbnails within one optimization procedure. Finally, we report on experimental results demonstrating the performances of the two double thumbnailing procedures and indicate directions towards full music structure analysis.
  • Keywords
    audio recording; audio signal processing; information retrieval; iterative methods; optimisation; signal representation; audio segment; audio thumbnailing; double thumbnails; full music structure analysis; music information retrieval; music recording; musical parts; optimization procedure; refrain section; repetition-based thumbnailing procedure; representative segments; thumbnail candidates; verse; Acoustics; Audio recording; Estimation; Iterative methods; Joints; Music information retrieval; Optimization; Music; Repetition; Segmentation; Structure; Thumbnailing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177949
  • Filename
    7177949