DocumentCode :
174536
Title :
Improved document classification through enhanced Naive Bayes algorithm
Author :
Sathyadevan, Shiju ; Athira, U. ; Sarath, P.R. ; Anjana, V.
Author_Institution :
Amrita Cybersecurity Centre, Amrita Vishwa Vidyapeetham, Kollam, India
fYear :
2014
fDate :
26-28 Aug. 2014
Firstpage :
100
Lastpage :
104
Abstract :
Immense growth in communication has paved way for existence of information across the world in wide separated zones. There exists a need for a mechanism to render apt information to the needy from this enormous source of information. This mechanism is of high demand for educational purposes. Knowledge based cloud (Kloud) proposes a solution to combine together the information in different area, which is managed by several organizations. It then organizes them into different sections and hence providing a platform to furnish relevant information to people in search of it. The paper discusses about a method based on Naive Bayes algorithm to classify documents pushed into "Kloud". A variation to this algorithm has been implemented by calculating term weight using "converged weight" method resulting in better accuracy and speed. A comparative study of proposed variance in classification algorithm against the actual algorithm was performed. Further we also implemented two subclassification algorithms namely hierarchical subclassification and subcategorization using document similarity method.
Keywords :
Bayes methods; cloud computing; document handling; knowledge based systems; pattern classification; Kloud; converged weight method; document classification improvement; document similarity method; educational purposes; enhanced naive Bayes algorithm; hierarchical subclassification algorithm; information organization; information rendering; information search; information source; knowledge based cloud; subcategorization algorithm; term weight; Accuracy; Classification algorithms; Computer security; Equations; Support vector machine classification; Text categorization; Training; Converged weight; Document classification; Naïve bayes; Term weight; Word vector;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Science & Engineering (ICDSE), 2014 International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4799-6870-1
Type :
conf
DOI :
10.1109/ICDSE.2014.6974619
Filename :
6974619
Link To Document :
بازگشت