Title :
Categorization of News Articles: A Model Based on Discriminative Term Extraction Method
Author :
Sanwaliya, Abhishek ; Shanker, Kripa ; Misra, Subhas C.
Author_Institution :
Dept. of Ind. & Manage. Eng., Indian Inst. of Technol. Kanpur, Kanpur, India
Abstract :
Categorization techniques have major contribution in building automated system capable to fulfill the needs of decision making tasks for better organization and management of resources. The objective of this research is to assess the relative performance of some well-known classification methods. Among the proposed approaches our discriminative term extraction (DTE) based combined naive bayes and K-NN (NB-KNN) approach has the advantages of short learning time due to its computational efficiency with comparatively high accuracy. We designed DTE based NBKNN model for multi-class, single label text categorization. Our experiments suggest that data characteristics have considerable impact on the performance of classification methods. The Results obtained from Reuters-21578 corpus shows that NB-KNN consistently outperforms the single naive bayes and K-NN classifiers on Precision, Recall and Fl scores. The results of the study suggest designing a classification system in which several classification methods can be combined to increase the reliability, consistency and accuracy of the categorization.
Keywords :
Bayes methods; decision making; knowledge acquisition; neural nets; text analysis; NB-KNN approach; decision making tasks; discriminative term extraction method; news article categorization; single label text categorization; Conference management; Data mining; Decision making; Engineering management; Feature extraction; Knowledge management; Resource management; Risk management; Technology management; Text categorization; K-NN classifier; Text categorization; combined naïve bayes and K-NN classifier; discriminative term extraction; naïve bayes classifier; performance;
Conference_Titel :
Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
Conference_Location :
Menuires
Print_ISBN :
978-1-4244-6081-6
DOI :
10.1109/DBKDA.2010.18