DocumentCode
2724002
Title
The Parallel Classification of Very Large Collections of Data on Multi-core Platforms
Author
Andrada, Dodut Aurora ; Lemnaru, Camelia ; Potolea, Rodica
Author_Institution
Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear
2011
fDate
6-8 July 2011
Firstpage
57
Lastpage
62
Abstract
Perhaps the most utilized and demanded task in data mining is classification. Most existing classification algorithms require all the data used for constructing the model for classification, or at least a good part of it, to be stored in the memory. This makes them limited by the availability of the memory. We present a parallel algorithm, based on the SPRINT decision tree, which eliminates the dependency on the memory available by storing the data to be processed in a database.
Keywords
data mining; decision trees; multiprocessing systems; parallel algorithms; very large databases; SPRINT decision tree; data mining; multicore platforms; parallel algorithm; parallel classification; very large collections; Buildings; Data mining; Decision trees; Indexes; Instruction sets; Training; classification; large datasets; multi-thread; parallel computation;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing (ISPDC), 2011 10th International Symposium on
Conference_Location
Cluj Napoca
Print_ISBN
978-1-4577-1536-5
Type
conf
DOI
10.1109/ISPDC.2011.18
Filename
6108256
Link To Document