• DocumentCode
    20385
  • Title

    Bring Your Own Learner: A Cloud-Based, Data-Parallel Commons for Machine Learning

  • Author

    Arnaldo, Ignacio ; Veeramachaneni, Kalyan ; Song, Andrew ; O´Reilly, Una-May

  • Author_Institution
    Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA, USA
  • Volume
    10
  • Issue
    1
  • fYear
    2015
  • fDate
    Feb. 2015
  • Firstpage
    20
  • Lastpage
    32
  • Abstract
    We introduce FCUBE, a cloud-based framework that enables machine learning researchers to contribute their learners to its community-shared repository. FCUBE exploits data parallelism in lieu of algorithmic parallelization to allow its users to efficiently tackle large data problems automatically. It passes random subsets of data generated via resampling to multiple learners that it executes simultaneously and then it combines their model predictions with a simple fusion technique. It is an example of what we have named a Bring Your Own Learner model. It allows multiple machine learning researchers to contribute algorithms in a plug-and-play style. We contend that the Bring Your Own Learner model signals a design shift in cloud-based machine learning infrastructure because it is capable of executing anyone´s supervised machine learning algorithm. We demonstrate FCUBE executing five different learners contributed by three different machine learning groups on a 100 node deployment on Amazon EC2. They collectively solve a publicly available classification problem trained with 11 million exemplars from the Higgs dataset.
  • Keywords
    cloud computing; data handling; learning (artificial intelligence); parallel processing; FCUBE; Higgs dataset; bring your own learner model; community shared repository; data parallel commons; data problems; fusion technique; supervised machine learning algorithm; Algorithm design and analysis; Classification; Cloud computing; Data models; Machine learning algorithms; Parallel processing; Predictive models;
  • fLanguage
    English
  • Journal_Title
    Computational Intelligence Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1556-603X
  • Type

    jour

  • DOI
    10.1109/MCI.2014.2369892
  • Filename
    7010434