DocumentCode :
693410
Title :
PLAR: Parallel Large-Scale Attribute Reduction on Cloud Systems
Author :
Junbo Zhang ; Tianrui Li ; Yi Pan
Author_Institution :
Sch. of Inf. Sci. & Technol., Southwest Jiaotong Univ., Chengdu, China
fYear :
2013
fDate :
16-18 Dec. 2013
Firstpage :
184
Lastpage :
191
Abstract :
Attribute reduction for big data is viewed as an important preprocessing step in the areas of pattern recognition, machine learning and data mining. In this paper, a novel parallel method based on MapReduce for large-scale attribute reduction is proposed. By using this method, several representative heuristic attribute reduction algorithms in rough set theory have been parallelized. Further, each of the improved parallel algorithms can select the same attribute reduct as its sequential version, therefore, owns the same classification accuracy. An extensive experimental evaluation shows that these parallel algorithms are effective for big data.
Keywords :
Big Data; cloud computing; data mining; learning (artificial intelligence); parallel algorithms; pattern recognition; rough set theory; MapReduce; PLAR; big data; cloud systems; data mining; heuristic attribute reduction algorithms; machine learning; parallel algorithms; parallel large-scale attribute reduction; pattern recognition; rough set theory; Acceleration; Approximation methods; Big data; Entropy; Machine learning algorithms; Parallel algorithms; Set theory; Attribute Reduction; Big Data; MapReduce; Rough Set Theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2013 International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4799-2418-9
Type :
conf
DOI :
10.1109/PDCAT.2013.36
Filename :
6904253
Link To Document :
بازگشت