• DocumentCode
    2777876
  • Title

    Linear basis-function t-SNE for fast nonlinear dimensionality reduction

  • Author

    Gisbrecht, A. ; Mokbel, B. ; Hammer, B.

  • Author_Institution
    CITEC Center of Excellence, Univ. of Bielefeld, Bielefeld, Germany
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    t-distributed stochastic neighbor embedding (t-SNE) constitutes a nonlinear dimensionality reduction technique which is particularly suited to visualize high dimensional data sets with intrinsic nonlinear structures. A major drawback, however, consists in its squared complexity which makes the technique infeasible for large data sets or online application in an interactive framework. In addition, since the technique is non parametric, it possesses no direct method to extend the technique to novel data points. In this contribution, we propose an extension of t-SNE to an explicit mapping. In the limit, it reduces to standard non-parametric t-SNE, while offering a feasible nonlinear embedding function for other parameter choices. We evaluate the performance of the technique when trained on a small subpart of the given data only. It turns out that its generalization ability is good when evaluated with the standard quality curve. Further, in many cases, it obtains a quality which approximates the quality of t-SNE when trained on the full data set, albeit only 10% of the data are used for training. This opens the way towards efficient nonlinear dimensionality reduction techniques as required in interactive settings.
  • Keywords
    data visualisation; probability; very large databases; data point; explicit mapping; generalization ability; high dimensional data set visualization; interactive framework; intrinsic nonlinear structure; large data set; linear basis-function t-SNE; nonlinear dimensionality reduction; nonlinear embedding function; probability; squared complexity; standard nonparametric t-SNE; standard quality curve; t-distributed stochastic neighbor embedding; Biological cells; Complexity theory; Cost function; Kernel; Standards; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2012 International Joint Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-1488-6
  • Electronic_ISBN
    2161-4393
  • Type

    conf

  • DOI
    10.1109/IJCNN.2012.6252809
  • Filename
    6252809