DocumentCode
3399163
Title
Building clusters with distributed features for text classification using KNN
Author
Wajeed, Mohammed Abdul ; Adilakshmi, T.
Author_Institution
SCSI, Sreenidhi Inst. of Sci. & Technol., Hyderabad, India
fYear
2012
fDate
10-12 Jan. 2012
Firstpage
1
Lastpage
6
Abstract
Bulk data is generated in the era of Information Technology. If it is not stored in a properly systematic manner then the generated data cannot be reused. This is because navigation becomes if not impossible, certainly very difficult. So we classify the data before it is stored. Present paper explores the techniques to store the data in a supervised classification paradigm using distributed features. Initially Soft, hard and mixed Clusters are build based on the distributed features later the clusters are used to classify the documents based on the K-nearest neighbour classification algorithm.
Keywords
learning (artificial intelligence); pattern classification; pattern clustering; text analysis; K-nearest neighbour classification algorithm; KNN; bulk data; distributed features; information technology; supervised classification paradigm; text classification; Accuracy; Equations; Informatics; Text categorization; Training; Training data; Vectors; distributed features; knn-classifier; soft-hard clusters; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Communication and Informatics (ICCCI), 2012 International Conference on
Conference_Location
Coimbatore
Print_ISBN
978-1-4577-1580-8
Type
conf
DOI
10.1109/ICCCI.2012.6158839
Filename
6158839
Link To Document