Title :
Machine learning on Big Data
Author :
Condie, T. ; Mineiro, P. ; Polyzotis, N. ; Weimer, M.
Author_Institution :
Cloud & Inf. Services Lab., Microsoft, Redmond, WA, USA
Abstract :
Statistical Machine Learning has undergone a phase transition from a pure academic endeavor to being one of the main drivers of modern commerce and science. Even more so, recent results such as those on tera-scale learning [1] and on very large neural networks [2] suggest that scale is an important ingredient in quality modeling. This tutorial introduces current applications, techniques and systems with the aim of cross-fertilizing research between the database and machine learning communities. The tutorial covers current large scale applications of Machine Learning, their computational model and the workflow behind building those. Based on this foundation, we present the current state-of-the-art in systems support in the bulk of the tutorial. We also identify critical gaps in the state-of-the-art. This leads to the closing of the seminar, where we introduce two sets of open research questions: Better systems support for the already established use cases of Machine Learning and support for recent advances in Machine Learning research.
Keywords :
data analysis; learning (artificial intelligence); big data; cross fertilizing research; database; neural networks; phase transition; pure academic endeavor; quality modeling; statistical machine learning; tera scale learning; tutorial; Big data; Communities; Computational modeling; Databases; Machine learning algorithms; Seminars; Tutorials;
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2013.6544913