• DocumentCode
    3152835
  • Title

    Nonlinear dimensionality reduction approaches applied to music and textural sounds

  • Author

    Dupont, Samuel ; Ravet, Thierry ; Picard-Limpens, Cecile ; Frisson, Christian

  • Author_Institution
    NUMEDIART Inst. for New Media Art Technol., Univ. of Mons (Belgium), Mons, Belgium
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Recently, various dimensionality reduction approaches have been proposed as alternatives to PCA or LDA. These improved approaches do not rely on a linearity assumption, and are hence capable of discovering more complex embeddings within different regions of the data sets. Despite their success on artificial datasets, it is not straightforward to predict which technique is the most appropriate for a given real dataset. In this paper, we empirically evaluate recent techniques on two real audio use cases: musical instrument loops used in music production and sound effects used in sound editing. ISOMAP and t-SNE are being compared to PCA in a visualization problem, where we end up with a two-dimensional view. Various evaluation measures are used: classification performance, as well as trustworthiness/continuity assessing the preservation of neighborhoods. Although PCA and ISOMAP can yield good continuity performance even locally (samples in the original space remain close-by in the low-dimensional one), they fail to preserve the structure of the data well enough to ensure that distinct subgroups remain separate in the visualization. We show that t-SNE presents the best performance, and can even be beneficial as a pre-processing stage for improving classification when the amount of labeled data is low.
  • Keywords
    audio signal processing; information retrieval; multimedia computing; music; musical instruments; principal component analysis; ISOMAP; LDA; PCA; artificial datasets; classification performance; complex embeddings; continuity performance; linearity assumption; music production; musical instrument loops; nonlinear dimensionality reduction; real audio use cases; sound editing; sound effects; t-SNE; textural sounds; visualization problem; Databases; Instruments; Manifolds; Measurement; Music; Principal component analysis; Standards; Dimensionality reduction; audio and music analysis; multimedia information retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2013.6607550
  • Filename
    6607550