• DocumentCode
    2194786
  • Title

    Parallelized Boosting with Map-Reduce

  • Author

    Palit, Indranil ; Reddy, Chandan K.

  • Author_Institution
    Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
  • fYear
    2010
  • fDate
    13-13 Dec. 2010
  • Firstpage
    1346
  • Lastpage
    1353
  • Abstract
    In this paper, we propose two novel algorithms, ADABOOST.PL (Parallel ADABOOST) and LOGITBOOST.PL (Parallel LOGITBOOST), that facilitate simultaneous participation of multiple computing nodes to construct a boosted classifier. Our algorithms can induce boosted models whose generalization performance is close to the respective baseline classifier. By exploiting their own parallel architecture both the algorithms gain significant speedup. We used the Map-Reduce framework to implement our algorithms and experimented on a variety of synthetic and real-world data sets to demonstrate the performance in terms of classification accuracy, speedup and scaleup.
  • Keywords
    learning (artificial intelligence); parallel algorithms; pattern classification; set theory; AdaBoost.PL; LogitBoost.PL; Map-Reduce framework; baseline classifier; boosted classifier; boosted models; classification accuracy; generalization performance; multiple computing nodes; parallel AdaBoost; parallel LogitBoost; parallel algorithms; parallel architecture; real-world data sets; Boosting; classification; distributed computing; parallel algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4244-9244-2
  • Electronic_ISBN
    978-0-7695-4257-7
  • Type

    conf

  • DOI
    10.1109/ICDMW.2010.180
  • Filename
    5693449