• DocumentCode
    3248449
  • Title

    Using cluster computing to support automatic and dynamic database clustering

  • Author

    Guinepain, Sylvain ; Gruenwald, Le

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Oklahoma Norman, Norman, OK
  • fYear
    2008
  • fDate
    Sept. 29 2008-Oct. 1 2008
  • Firstpage
    394
  • Lastpage
    401
  • Abstract
    Query response time is the number one metrics when it comes to database performance. Because of data proliferation, efficient access methods and data storage techniques have become increasingly critical to maintain an acceptable query response time. Retrieving data from disk is several orders of magnitude slower than retrieving it from memory, it is easy to see the direct correlation between query response time and the number of disk I/Os. One of the common ways to reduce disk I/Os and therefore improve query response time is database clustering, which is a process that partitions the database vertically (attribute clustering) and/or horizontally (record clustering). A clustering is optimized for a given set of queries. However in dynamic systems the queries change with time, the clustering in place becomes obsolete, and the database needs to be re-clustered dynamically. This paper presents an efficient algorithm for attribute clustering that dynamically and automatically generates attribute clusters based on closed item sets mined from the attributes sets found in the queries running against the database. The paper then discusses how this algorithm can be implemented using the cluster computing paradigm to reduce query response time even further through parallelism and data redundancy.
  • Keywords
    data handling; pattern clustering; query processing; attribute clustering; cluster computing; data parallelism; data proliferation; data redundancy; data retrieval; data storage; dynamic database clustering; query response; record clustering; Clustering algorithms; Computer science; Concurrent computing; Delay; Humans; Information retrieval; Memory; Operating systems; Parallel processing; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing, 2008 IEEE International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1552-5244
  • Print_ISBN
    978-1-4244-2639-3
  • Electronic_ISBN
    1552-5244
  • Type

    conf

  • DOI
    10.1109/CLUSTR.2008.4663800
  • Filename
    4663800