DocumentCode :
145750
Title :
Distributed boosting algorithm for classification of text documents
Author :
Sarnovsky, Martin ; Vronc, Michal
Author_Institution :
Dept. of Cybern. & Artificial Intell., Tech. Univ. in Kosice, Kosice, Slovakia
fYear :
2014
fDate :
23-25 Jan. 2014
Firstpage :
217
Lastpage :
220
Abstract :
Presented paper focuses on the area of analysis and classification of textual documents. We present the classification of documents based on boosting method applied on the decision tree algorithm. Main objective of the paper is to present the implementation of distributed boosting algorithm based on Map Reduce paradigm. We have used the GridGain framework as a platform for distributed data processing and have tested the implemented solution on two different dataset within our testing environment.
Keywords :
data mining; decision trees; learning (artificial intelligence); pattern classification; text analysis; GridGain framework; Map Reduce; boosting method; decision tree algorithm; distributed boosting algorithm; distributed data processing; text document classification; textual document analysis; Algorithm design and analysis; Boosting; Classification algorithms; Computational modeling; Informatics; Text mining; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applied Machine Intelligence and Informatics (SAMI), 2014 IEEE 12th International Symposium on
Conference_Location :
Herl´any
Print_ISBN :
978-1-4799-3441-6
Type :
conf
DOI :
10.1109/SAMI.2014.6822410
Filename :
6822410
Link To Document :
بازگشت