• DocumentCode
    2724002
  • Title

    The Parallel Classification of Very Large Collections of Data on Multi-core Platforms

  • Author

    Andrada, Dodut Aurora ; Lemnaru, Camelia ; Potolea, Rodica

  • Author_Institution
    Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
  • fYear
    2011
  • fDate
    6-8 July 2011
  • Firstpage
    57
  • Lastpage
    62
  • Abstract
    Perhaps the most utilized and demanded task in data mining is classification. Most existing classification algorithms require all the data used for constructing the model for classification, or at least a good part of it, to be stored in the memory. This makes them limited by the availability of the memory. We present a parallel algorithm, based on the SPRINT decision tree, which eliminates the dependency on the memory available by storing the data to be processed in a database.
  • Keywords
    data mining; decision trees; multiprocessing systems; parallel algorithms; very large databases; SPRINT decision tree; data mining; multicore platforms; parallel algorithm; parallel classification; very large collections; Buildings; Data mining; Decision trees; Indexes; Instruction sets; Training; classification; large datasets; multi-thread; parallel computation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing (ISPDC), 2011 10th International Symposium on
  • Conference_Location
    Cluj Napoca
  • Print_ISBN
    978-1-4577-1536-5
  • Type

    conf

  • DOI
    10.1109/ISPDC.2011.18
  • Filename
    6108256