DocumentCode
20385
Title
Bring Your Own Learner: A Cloud-Based, Data-Parallel Commons for Machine Learning
Author
Arnaldo, Ignacio ; Veeramachaneni, Kalyan ; Song, Andrew ; O´Reilly, Una-May
Author_Institution
Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA, USA
Volume
10
Issue
1
fYear
2015
fDate
Feb. 2015
Firstpage
20
Lastpage
32
Abstract
We introduce FCUBE, a cloud-based framework that enables machine learning researchers to contribute their learners to its community-shared repository. FCUBE exploits data parallelism in lieu of algorithmic parallelization to allow its users to efficiently tackle large data problems automatically. It passes random subsets of data generated via resampling to multiple learners that it executes simultaneously and then it combines their model predictions with a simple fusion technique. It is an example of what we have named a Bring Your Own Learner model. It allows multiple machine learning researchers to contribute algorithms in a plug-and-play style. We contend that the Bring Your Own Learner model signals a design shift in cloud-based machine learning infrastructure because it is capable of executing anyone´s supervised machine learning algorithm. We demonstrate FCUBE executing five different learners contributed by three different machine learning groups on a 100 node deployment on Amazon EC2. They collectively solve a publicly available classification problem trained with 11 million exemplars from the Higgs dataset.
Keywords
cloud computing; data handling; learning (artificial intelligence); parallel processing; FCUBE; Higgs dataset; bring your own learner model; community shared repository; data parallel commons; data problems; fusion technique; supervised machine learning algorithm; Algorithm design and analysis; Classification; Cloud computing; Data models; Machine learning algorithms; Parallel processing; Predictive models;
fLanguage
English
Journal_Title
Computational Intelligence Magazine, IEEE
Publisher
ieee
ISSN
1556-603X
Type
jour
DOI
10.1109/MCI.2014.2369892
Filename
7010434
Link To Document