DocumentCode
145750
Title
Distributed boosting algorithm for classification of text documents
Author
Sarnovsky, Martin ; Vronc, Michal
Author_Institution
Dept. of Cybern. & Artificial Intell., Tech. Univ. in Kosice, Kosice, Slovakia
fYear
2014
fDate
23-25 Jan. 2014
Firstpage
217
Lastpage
220
Abstract
Presented paper focuses on the area of analysis and classification of textual documents. We present the classification of documents based on boosting method applied on the decision tree algorithm. Main objective of the paper is to present the implementation of distributed boosting algorithm based on Map Reduce paradigm. We have used the GridGain framework as a platform for distributed data processing and have tested the implemented solution on two different dataset within our testing environment.
Keywords
data mining; decision trees; learning (artificial intelligence); pattern classification; text analysis; GridGain framework; Map Reduce; boosting method; decision tree algorithm; distributed boosting algorithm; distributed data processing; text document classification; textual document analysis; Algorithm design and analysis; Boosting; Classification algorithms; Computational modeling; Informatics; Text mining; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Applied Machine Intelligence and Informatics (SAMI), 2014 IEEE 12th International Symposium on
Conference_Location
Herl´any
Print_ISBN
978-1-4799-3441-6
Type
conf
DOI
10.1109/SAMI.2014.6822410
Filename
6822410
Link To Document