DocumentCode
1583489
Title
Ensemble learning approach in improved K Nearest Neighbor algorithm for Text categorization
Author
Iswarya, P. ; Radha, V.
Author_Institution
Dept. of Comput. Sci., Avinashilingam Inst. for Home Sci. & Higher Educ. for Women, Coimbatore, India
fYear
2015
Firstpage
1
Lastpage
5
Abstract
Due to the tremendous growth of digital content in World Wide Web (WWW), Text categorization has become an important tool to manage and organize text related data. This paper proposes an Ensemble Learning approach in Improved K Nearest Neighbor algorithm for Text Categorization (EINNTC), which consists of single pass clustering, Ensemble learning and KNN algorithm. The EINNTC method provides solution to traditional KNN classifier issues, by reducing the huge text similarity computation complexity, avoids an impact of noisy training sample, and expediting the process of finding K nearest neighbors. The experiments were carried out with standard benchmark Reuters dataset, and their empirical results shows that the proposed method outperforms the SVM and KNN classifiers.
Keywords
Internet; computational complexity; learning (artificial intelligence); pattern classification; text analysis; EINNTC; EINNTC method; KNN classifier algorithm; SVM classifiers; WWW; World Wide Web; ensemble learning approach in improved k nearest neighbor algorithm for text categorization; noisy training sample; standard benchmark Reuters dataset; text related data management; text related data organization; text similarity computation complexity; Classification algorithms; Clustering algorithms; Conferences; Support vector machines; Technological innovation; Text categorization; Training; Categorization; Clustering; Ensemble; K nearest neighbor; Support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Information, Embedded and Communication Systems (ICIIECS), 2015 International Conference on
Conference_Location
Coimbatore
Print_ISBN
978-1-4799-6817-6
Type
conf
DOI
10.1109/ICIIECS.2015.7193250
Filename
7193250
Link To Document