DocumentCode :
3298974
Title :
Enumeration of maximal clique for mining spatial co-location patterns
Author :
Al-Naymat, Ghazi
Author_Institution :
Univ. of Sydney, Sydney
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
126
Lastpage :
133
Abstract :
This paper presents a systematic approach to mine co- location patterns in Sloan Digital Sky Survey (SDSS) data. SDSS Data Release 5 (DR5) contains 3.6 TB of data. Availability of such large amount of useful data is an opportunity for application of data mining techniques to generate interesting information. The major reason for the lack of such data mining applications in SDSS is the unavailability of data in a suitable format. This work illustrates a procedure to obtain additional galaxy types from an available attributes and transform the data into maximal cliques of galaxies which in turn can be used as transactions for data mining applications. An efficient algorithm GridClique is proposed to generate maximal cliques from large spatial databases. It should be noted that the full general problem of extracting a maximal clique from a graph is known as NP-Hard. The experimental results show that the GridClique algorithm successfully generates all maximal cliques in the SDSS data and enables the generation of useful co-location patterns.
Keywords :
data mining; grid computing; very large databases; GridClique algorithm; data mining techniques; large spatial databases; maximal clique enumeration; sloan digital sky survey data; spatial colocation patterns mining; Astronomy; Australia; Brightness; Catalogs; Data mining; Focusing; Information technology; Mesh generation; Spatial databases; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on
Conference_Location :
Doha
Print_ISBN :
978-1-4244-1967-8
Electronic_ISBN :
978-1-4244-1968-5
Type :
conf
DOI :
10.1109/AICCSA.2008.4493526
Filename :
4493526
Link To Document :
بازگشت