• DocumentCode
    589155
  • Title

    Parallel Concept Drift Detection with Online Map-Reduce

  • Author

    Andrzejak, Artur ; Gomes, Joao Bartolo

  • Author_Institution
    Heidelberg Univ., Heidelberg, Germany
  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    402
  • Lastpage
    407
  • Abstract
    Empirical evidence shows that massive data sets have rarely (if ever) a stationary underlying distribution. To obtain meaningful classification models, partitioning data into different concepts is required as an inherent part of learning. However, existing state-of-the-art approaches to concept drift detection work only sequentially (i.e. in a non-parallel fashion) which is a serious scalability limitation. To address this issue, we extend one of the sequential approaches to work in parallel and propose an Online Map-Reduce Drift Detection Method (OMR-DDM). It uses the combined online error rate of the parallel classification algorithms to identify changes in the underlying concept. For reasons of algorithmic efficiency it is built on a modified version of the popular Map-Reduce paradigm which permits for using preliminary results within mappers. An experimental evaluation shows that the proposed method can accurately detect concept drift while exploiting parallel processing. This paves the way to obtaining classification models which consider concept drift on massive data.
  • Keywords
    parallel algorithms; pattern classification; OMR-DDM; algorithmic efficiency; classification models; concept change identification; data partitioning; online error rate; online map-reduce drift detection method; parallel classification algorithms; parallel concept drift detection; parallel processing; sequential approaches; Accuracy; Adaptation models; Computational modeling; Data models; Noise; Silicon; Synchronization; Concept-Drif; Map-Reduce; Parallel Data Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.102
  • Filename
    6406468