Title :
Improvement of the Data Mining Algorithm of Rough Set under the Framework of Map/Reduce
Author :
Ying Wang ; Jiqing Liu ; Qiong Liu
Author_Institution :
Beijing CEE Technol. Co., Ltd., Beijing, China
Abstract :
In order to solve the problem that there is a shortage of space and computing power of the traditional spatial data mining algorithm during the processing for massive spatial data information, a combination of Rough set and distributed framework is used in the process of spatial data mining. In this paper, parallel improvement is taken into the algorithm of the traditional Rough set for spatial data mining based on the basic theory of rough set and the Map/Reduce framework, which is efficient and cheap. Then, a spatial data example is utilized to show the feasibility of the improved parallel algorithm. Empirical results show that the improved parallel algorithm of Rough set for spatial data mining can not only effectively improve the efficiency of the algorithm but also meet the need of people to deal with massive spatial data which is hardly to the algorithm of traditional Rough set. Improved Rough set parallel algorithm for spatial data mining can effectively solve the problem of shortage for massive spatial data storage and computing power mining.
Keywords :
data mining; parallel algorithms; parallel programming; rough set theory; MapReduce Framework; data mining algorithm improvement; distributed framework; efficiency improvement; empirical analysis; improved rough set parallel algorithm; massive spatial data storage; massive-spatial data information processing; power mining; spatial data mining algorithm; Algorithm design and analysis; Data mining; Global Positioning System; Parallel algorithms; Programming; Spatial databases; Time complexity; Map/Reduce; Parallelization; Rough Set; Spatial data mining;
Conference_Titel :
Information Science and Cloud Computing Companion (ISCC-C), 2013 International Conference on
Conference_Location :
Guangzhou
DOI :
10.1109/ISCC-C.2013.80