DocumentCode
2984887
Title
Parallelization with Multiplicative Algorithms for Big Data Mining
Author
Dijun Luo ; Ding, Chibiao ; Heng Huang
Author_Institution
Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA
fYear
2012
fDate
10-13 Dec. 2012
Firstpage
489
Lastpage
498
Abstract
We propose a nontrivial strategy to parallelize a series of data mining and machine learning problems, including 1-class and 2-class support vector machines, nonnegative least square problems, and $ell_1$ regularized regression (LASSO) problems. Our strategy fortunately leads to extremely simple multiplicative algorithms which can be straightforwardly implemented in parallel computational environments, such as Map Reduce, or CUDA. We provide rigorous analysis of the correctness and convergence of the algorithm. We demonstrate the scalability and accuracy of our algorithms in comparison with other current leading algorithms.
Keywords
data mining; learning (artificial intelligence); regression analysis; support vector machines; 1-class support vector machine; 2-class support vector machine; CUDA; LASSO problem; Map Reduce; data mining; machine learning problem; multiplicative algorithm; nonnegative least square problem; nontrivial strategy; parallel computational environment; regularized regression; Algorithm design and analysis; Convergence; Data mining; Graphics processing units; Machine learning algorithms; Optimization; Support vector machines; Big Data; CUDA; LASSO; MapReduce; Support Vector Machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location
Brussels
ISSN
1550-4786
Print_ISBN
978-1-4673-4649-8
Type
conf
DOI
10.1109/ICDM.2012.155
Filename
6413876
Link To Document