Title :
Data mining based fragmentation technique for distributed data warehouses environment Using predicate construction technique
Author :
Karima, Tekaya ; Abdellatif, Abdelaziz ; Ounalli, Habib
Author_Institution :
Data-Process. Dept., Fac. of Sci. of Tunis, Tunis, Tunisia
Abstract :
Distributed Data Warehouses (DDWs) afford several advantages over traditional environments. Such architecture improves system performance by allowing data to be spread across data marts. Subsequently, queries can be run over smaller data sets and therefore their execution time reduces. To design an effective distributed model, it is important to manage an appropriate methodology for data fragmentation and fragment allocation. Nevertheless, very little works address this problem in a distributed context. This paper is focuses on DDW. It proposes a data mining-based horizontal fragmentation methodology for a relational DDW environment. This methodology combines the known predicate construction technique with a clustering method to fragment Data Warehouse (DW) relations. Fragments are then allocated to the corresponding site according to their frequency of use. We show experimentally with the use of the APB-1 release II benchmark that DW decentralization gives better performance. Global queries execution time is fewer by 80%.
Keywords :
data mining; data warehouses; clustering method; data fragmentation; data mining; distributed data warehouse; fragmentation technique; Allocation; Distributed data warehouse; Fragmentation; K-means;
Conference_Titel :
Networked Computing and Advanced Information Management (NCM), 2010 Sixth International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-7671-8
Electronic_ISBN :
978-89-88678-26-8