DocumentCode
3248449
Title
Using cluster computing to support automatic and dynamic database clustering
Author
Guinepain, Sylvain ; Gruenwald, Le
Author_Institution
Sch. of Comput. Sci., Univ. of Oklahoma Norman, Norman, OK
fYear
2008
fDate
Sept. 29 2008-Oct. 1 2008
Firstpage
394
Lastpage
401
Abstract
Query response time is the number one metrics when it comes to database performance. Because of data proliferation, efficient access methods and data storage techniques have become increasingly critical to maintain an acceptable query response time. Retrieving data from disk is several orders of magnitude slower than retrieving it from memory, it is easy to see the direct correlation between query response time and the number of disk I/Os. One of the common ways to reduce disk I/Os and therefore improve query response time is database clustering, which is a process that partitions the database vertically (attribute clustering) and/or horizontally (record clustering). A clustering is optimized for a given set of queries. However in dynamic systems the queries change with time, the clustering in place becomes obsolete, and the database needs to be re-clustered dynamically. This paper presents an efficient algorithm for attribute clustering that dynamically and automatically generates attribute clusters based on closed item sets mined from the attributes sets found in the queries running against the database. The paper then discusses how this algorithm can be implemented using the cluster computing paradigm to reduce query response time even further through parallelism and data redundancy.
Keywords
data handling; pattern clustering; query processing; attribute clustering; cluster computing; data parallelism; data proliferation; data redundancy; data retrieval; data storage; dynamic database clustering; query response; record clustering; Clustering algorithms; Computer science; Concurrent computing; Delay; Humans; Information retrieval; Memory; Operating systems; Parallel processing; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing, 2008 IEEE International Conference on
Conference_Location
Tsukuba
ISSN
1552-5244
Print_ISBN
978-1-4244-2639-3
Electronic_ISBN
1552-5244
Type
conf
DOI
10.1109/CLUSTR.2008.4663800
Filename
4663800
Link To Document