DocumentCode :
2724648
Title :
Data Mining based Query Processing using Rough Sets and Genetic Algorithms
Author :
Srinivasa, K.G. ; Jagadish, M. ; Venugopal, K.R. ; Patnaik, L.M.
Author_Institution :
Dept. of Comput. Sci., MS Ramaiah Inst. of Technol., Bangalore
fYear :
2007
fDate :
March 1 2007-April 5 2007
Firstpage :
275
Lastpage :
282
Abstract :
The optimization of queries is critical in database management systems and the complexity involved in finding optimal solutions has led to the development of heuristic approaches. Answering data mining query involves a random search over large databases. Due to the enormity of the data set involved, model simplification is necessary for quick answering of data mining queries. In this paper, we propose a hybrid model using rough sets and genetic algorithms for fast and efficient query answering. Rough sets are used to classify and summarize the datasets, whereas genetic algorithms are used for answering association related queries and feedback for adaptive classification. Here, we consider three types of queries, i.e., select, aggregate and classification based data mining queries. Summary tables that are built using rough sets and analytical model of attributes are used to speed up select queries. Mining associations, building concept hierarchies and reinforcement of reducts are achieved through genetic algorithms. The experiments are conducted on three real-life data sets, which include KDD 99 Cup data, Forest Cover-type data and Iris data. The performance of the proposed algorithm is analyzed for both execution time and classification accuracy and the results obtained are good
Keywords :
data mining; database management systems; genetic algorithms; query processing; rough set theory; adaptive classification; aggregate based data mining query processing; association mining; classification based data mining query processing; concept hierarchies; database management systems; genetic algorithms; large databases; query answering; query optimization; random search; reduct reinforcement; rough sets; select based data mining query processing; Aggregates; Analytical models; Data mining; Database systems; Feedback; Genetic algorithms; Iris; Performance analysis; Query processing; Rough sets; Genetic Algorithms; Optimization; Query Answering; Rough Sets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
Type :
conf
DOI :
10.1109/CIDM.2007.368884
Filename :
4221308
Link To Document :
بازگشت