DocumentCode
2536108
Title
Categorization of News Articles: A Model Based on Discriminative Term Extraction Method
Author
Sanwaliya, Abhishek ; Shanker, Kripa ; Misra, Subhas C.
Author_Institution
Dept. of Ind. & Manage. Eng., Indian Inst. of Technol. Kanpur, Kanpur, India
fYear
2010
fDate
11-16 April 2010
Firstpage
149
Lastpage
154
Abstract
Categorization techniques have major contribution in building automated system capable to fulfill the needs of decision making tasks for better organization and management of resources. The objective of this research is to assess the relative performance of some well-known classification methods. Among the proposed approaches our discriminative term extraction (DTE) based combined naive bayes and K-NN (NB-KNN) approach has the advantages of short learning time due to its computational efficiency with comparatively high accuracy. We designed DTE based NBKNN model for multi-class, single label text categorization. Our experiments suggest that data characteristics have considerable impact on the performance of classification methods. The Results obtained from Reuters-21578 corpus shows that NB-KNN consistently outperforms the single naive bayes and K-NN classifiers on Precision, Recall and Fl scores. The results of the study suggest designing a classification system in which several classification methods can be combined to increase the reliability, consistency and accuracy of the categorization.
Keywords
Bayes methods; decision making; knowledge acquisition; neural nets; text analysis; NB-KNN approach; decision making tasks; discriminative term extraction method; news article categorization; single label text categorization; Conference management; Data mining; Decision making; Engineering management; Feature extraction; Knowledge management; Resource management; Risk management; Technology management; Text categorization; K-NN classifier; Text categorization; combined naïve bayes and K-NN classifier; discriminative term extraction; naïve bayes classifier; performance;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
Conference_Location
Menuires
Print_ISBN
978-1-4244-6081-6
Type
conf
DOI
10.1109/DBKDA.2010.18
Filename
5477131
Link To Document