• DocumentCode
    1667744
  • Title

    Online Nonlinear Classification for High-Dimensional Data

  • Author

    Vanli, N. Denizcan ; Ozkan, Huseyin ; Delibalta, Ibrahim ; Kozat, Suleyman S.

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Bilkent Univ., Ankara, Turkey
  • fYear
    2015
  • Firstpage
    685
  • Lastpage
    688
  • Abstract
    We study online binary classification problem under the empirical zero-one loss function. We introduce a novel randomized classification algorithm based on highly dynamic hierarchical models that partition the feature space. Our approach jointly and sequentially learns the partitioning of the feature space, the optimal classifier among all doubly exponential number of classifiers defined by the tree, and the individual region classifiers in order to directly minimize the cumulative loss. Although we adapt the entire hierarchical model to minimize a global loss function, the computational complexity of the introduced algorithm scales linearly with the dimensionality of the feature space and the depth of the tree. Furthermore, our algorithm can be applied to any streaming data without requiring a training phase or prior information, hence processes data on-the-fly and then discards it, which makes the introduced algorithm significantly appealing for applications involving "big data". We evaluate the performance of the introduced algorithm over different real data sets.
  • Keywords
    Big Data; computational complexity; minimisation; pattern classification; randomised algorithms; trees (mathematics); big data; computational complexity; empirical zero-one loss function; feature space dimensionality; global loss function minimization; high-dimensional data; highly dynamic hierarchical models; online binary classification problem; online nonlinear classification; randomized classification algorithm; streaming data; trees; Big data; Boosting; Computational complexity; Heuristic algorithms; Partitioning algorithms; Signal processing algorithms; high-dimensional data; nonlinear classification; online classification; randomized algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (BigData Congress), 2015 IEEE International Congress on
  • Conference_Location
    New York, NY
  • Print_ISBN
    978-1-4673-7277-0
  • Type

    conf

  • DOI
    10.1109/BigDataCongress.2015.109
  • Filename
    7207293