DocumentCode :
2404901
Title :
Efficient evaluation of queries with mining predicates
Author :
Chaudhuri, Surajit ; Narasayya, Vivek ; Sarawagi, Sunita
Author_Institution :
Microsoft Corp., Redmond, WA, USA
fYear :
2002
fDate :
2002
Firstpage :
529
Lastpage :
540
Abstract :
Modern relational database systems are beginning to support ad-hoc queries on data mining models. In this paper, we explore novel techniques for optimizing queries that apply mining models to relational data. For such queries, we use the internal structure of the mining model to automatically derive traditional database predicates. We present algorithms for deriving such predicates for some popular discrete mining models: decision trees, naive Bayes, and clustering. Our experiments on a Microsoft SQL Server 2000 demonstrate that these derived predicates can significantly reduce the cost of evaluating such queries
Keywords :
Bayes methods; SQL; data mining; decision trees; file servers; pattern clustering; query processing; relational databases; Microsoft SQL Server 2000; ad-hoc queries; clustering; data mining; database predicates; decision trees; discrete mining models; model internal structure; naive Bayes model; query evaluation cost; query optimization; relational database systems; Business; Chromium; Clustering algorithms; Costs; Data mining; Engines; Filtering; Postal services; Predictive models; Relational databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2002. Proceedings. 18th International Conference on
Conference_Location :
San Jose, CA
ISSN :
1063-6382
Print_ISBN :
0-7695-1531-2
Type :
conf
DOI :
10.1109/ICDE.2002.994772
Filename :
994772
Link To Document :
بازگشت