Title :
Constructing Classification Model with MapReduce
Author :
Chen, Xiangxiang ; Wu, Kaigui ; Wu, Changze
Author_Institution :
Dept. of Comput. Sci., Chongqing Univ., Chongqing, China
Abstract :
Abstract-By analyzing the process of classification and MapReduce computing paradigms, it is found that the parallel and distributed computing model in MapReduce is appropriate for constructing classifier model. This paper presents a MapReduce algorithm for parallel and distributed classification, aiming to reduce the computational time in training process on large scale documents. Our experiment shows that the running time of the algorithm is greatly shortened and it is capable for larger scale documents.
Keywords :
data mining; parallel programming; pattern classification; text analysis; MapReduce computing; classification model; computational time reduction; distributed computing; larger scale document; parallel computing; Arrays; Classification algorithms; Computational modeling; Data mining; Feature extraction; Training; Training data; Keywords-classification; MapReduce; distributed computing; model; parallel;
Conference_Titel :
Multimedia Information Networking and Security (MINES), 2010 International Conference on
Conference_Location :
Nanjing, Jiangsu
Print_ISBN :
978-1-4244-8626-7
Electronic_ISBN :
978-0-7695-4258-4
DOI :
10.1109/MINES.2010.134