• DocumentCode
    177640
  • Title

    Music segment similarity using 2D-Fourier Magnitude Coefficients

  • Author

    Nieto, Oriol ; Bello, Juan P.

  • Author_Institution
    Music & Audio Res. Lab., New York Univ., New York, NY, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    664
  • Lastpage
    668
  • Abstract
    Music segmentation is the task of automatically identifying the different segments of a piece. In this work we present a novel approach to cluster the musical segments based on their acoustic similarity by using 2D-Fourier Magnitude Coefficients (2D-FMCs). These coefficients, computed from a chroma representation, significantly simplify the problem of clustering the different segments since they are key transposition and phase shift invariant. We explore various strategies to obtain the 2D-FMC patches that represent entire segments and apply k-means to label them. Finally, we discuss possible ways of estimating k and compare our competitive results with the current state of the art.
  • Keywords
    Fourier transforms; acoustic signal processing; audio signal processing; learning (artificial intelligence); music; pattern clustering; signal representation; 2D-FMC; 2D-Fourier magnitude coefficients; acoustic similarity; chroma representation; k-means algorithm; music segment similarity; music segmentation; segment clustering; Multiple signal classification; Music; Music information retrieval; Speech; Speech processing; Vectors; 2D-Fourier Transform; Clustering; Music Segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853679
  • Filename
    6853679