Title :
FactorBase : Multi-relational model learning with SQL all the way
Author :
Zhensong Qian;Oliver Schulte
Author_Institution :
Simon Fraser University, Vancouver-Burnaby, Canada
Abstract :
We describe FactorBase, a new SQL-based framework that leverages a relational database management system to support multi-relational model discovery. A multi-relational statistical model provides an integrated analysis of the heterogeneous and interdependent data resources in the database. We adopt the BayesStore design philosophy: statistical models are stored and managed as first-class citizens inside a database [30]. Whereas previous systems like BayesStore support multi-relational inference, FactorBase supports multi-relational learning. A case study on six benchmark databases evaluates how our system supports a challenging machine learning application, namely learning a first-order Bayesian network model for an entire database. Model learning in this setting has to examine a large number of potential statistical associations across data tables. Our implementation shows how the SQL constructs in Factor-Base facilitate the fast, modular, and reliable development of highly scalable model learning systems.
Keywords :
"Databases","Computational modeling","Data models","Bayes methods","Random variables","Graphical models","Benchmark testing"
Conference_Titel :
Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on
Print_ISBN :
978-1-4673-8272-4
DOI :
10.1109/DSAA.2015.7344828