Title :
A Parallel Spatial Co-location Mining Algorithm Based on MapReduce
Author :
Jin Soung Yoo ; Boulware, Douglas ; Kimmey, David
Author_Institution :
Dept. of Comput. Sci., Indiana Univ.-Purdue Univ., Fort Wayne, IN, USA
fDate :
June 27 2014-July 2 2014
Abstract :
Spatial association rule mining is a useful tool for discovering correlations and interesting relationships among spatial objects. Co-locations, or sets of spatial events which are frequently observed together in close proximity, are particularly useful for discovering their spatial dependencies. Although a number of spatial co-location mining algorithms have been developed, the computation of co-location pattern discovery remains prohibitively expensive with large data size and dense neighborhoods. We propose to leverage the power of parallel processing, in particular, the MapReduce framework to achieve higher spatial mining processing efficiency. MapReduce-like systems have been proven to be an efficient framework for large-scale data processing on clusters of commodity machines, and for big data analysis for many applications. The proposed parallel co-location mining algorithm was developed on MapReduce. The experimental result of the developed algorithm shows scalability in computational performance.
Keywords :
Big Data; data mining; parallel programming; Big Data analysis; MapReduce framework; co-location pattern discovery; commodity machine clusters; computational performance; large-scale data processing; parallel processing; parallel spatial co-location mining algorithm; spatial association rule mining; spatial dependency discovery; spatial event sets; spatial mining processing efficiency; spatial objects; Big data; Data mining; Distributed databases; Indexes; Parallel processing; Partitioning algorithms; Spatial databases; MapReduce; cloud computing; co-location pattern; spatial association analysis; spatial data mining;
Conference_Titel :
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5056-0
DOI :
10.1109/BigData.Congress.2014.14