• DocumentCode
    2984887
  • Title

    Parallelization with Multiplicative Algorithms for Big Data Mining

  • Author

    Dijun Luo ; Ding, Chibiao ; Heng Huang

  • Author_Institution
    Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    489
  • Lastpage
    498
  • Abstract
    We propose a nontrivial strategy to parallelize a series of data mining and machine learning problems, including 1-class and 2-class support vector machines, nonnegative least square problems, and $ell_1$ regularized regression (LASSO) problems. Our strategy fortunately leads to extremely simple multiplicative algorithms which can be straightforwardly implemented in parallel computational environments, such as Map Reduce, or CUDA. We provide rigorous analysis of the correctness and convergence of the algorithm. We demonstrate the scalability and accuracy of our algorithms in comparison with other current leading algorithms.
  • Keywords
    data mining; learning (artificial intelligence); regression analysis; support vector machines; 1-class support vector machine; 2-class support vector machine; CUDA; LASSO problem; Map Reduce; data mining; machine learning problem; multiplicative algorithm; nonnegative least square problem; nontrivial strategy; parallel computational environment; regularized regression; Algorithm design and analysis; Convergence; Data mining; Graphics processing units; Machine learning algorithms; Optimization; Support vector machines; Big Data; CUDA; LASSO; MapReduce; Support Vector Machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.155
  • Filename
    6413876