DocumentCode
577463
Title
A study on the selection of bitmap join index using data mining techniques
Author
An, Hyoung Geun ; Koh, Jae Jin
Author_Institution
School of Electrical Engineering, University of Ulsan 680-749 San 29, Mugeo2-Dong, Nam-Gu, Ulsan Korea
fYear
2012
fDate
18-21 Sept. 2012
Firstpage
1
Lastpage
5
Abstract
The problem of indices selection on the large size data warehouses affects the efficiency of the query processing of the data warehouses. The lower query processing costs can be obtained by using indices, but the indices occupy the large storage areas and induce the index maintenance costs which are accompanied by database updates. By using the bitmap join indices, we can optimize the star join queries which join a fact table and many dimension tables, and do the selection on dimension tables in data warehouses. Although the bitmap join index takes the lower storage costs, the task to select the indexing attributes among the huge candidate attributes is difficult. In this paper we reduce the number of candidate attributes for bitmap join index by the data mining techniques. Compared to the existing techniques which reduce the number of candidate attributes by the frequencies of attributes, we consider the frequencies of attributes and the size of dimmension tables and the size of the tuples of the dimension tables and the page size of disk. We reduce the great number of candidate attributes by using the frequent itemsets mining techniques. We make the bitmap join indices which have the least costs and the least storage areas adapted to storage constraints by using the cost functions applied to the bitmap join indices of the candidate attributes. We compare the existing techniques and ours, and analyze them in order to evaluate the efficiencies of ours.
Keywords
cost reduction; data mining; data warehouses; disc storage; indexing; optimisation; query processing; relational databases; storage management; bitmap join index selection; candidate attributes; cost functions; data mining techniques; dimension tables; disk page size; indexing attributes; large size data warehouses; query processing costs; star join query optimization; storage areas; storage costs; Data mining; Data warehouses; Indexes; Itemsets; Marketing and sales; Query processing; Bitmap join index; data mining; frequent itemsets; relational data warehouse;
fLanguage
English
Publisher
ieee
Conference_Titel
Strategic Technology (IFOST), 2012 7th International Forum on
Conference_Location
Tomsk
Print_ISBN
978-1-4673-1772-6
Type
conf
DOI
10.1109/IFOST.2012.6357667
Filename
6357667
Link To Document