• DocumentCode
    2536108
  • Title

    Categorization of News Articles: A Model Based on Discriminative Term Extraction Method

  • Author

    Sanwaliya, Abhishek ; Shanker, Kripa ; Misra, Subhas C.

  • Author_Institution
    Dept. of Ind. & Manage. Eng., Indian Inst. of Technol. Kanpur, Kanpur, India
  • fYear
    2010
  • fDate
    11-16 April 2010
  • Firstpage
    149
  • Lastpage
    154
  • Abstract
    Categorization techniques have major contribution in building automated system capable to fulfill the needs of decision making tasks for better organization and management of resources. The objective of this research is to assess the relative performance of some well-known classification methods. Among the proposed approaches our discriminative term extraction (DTE) based combined naive bayes and K-NN (NB-KNN) approach has the advantages of short learning time due to its computational efficiency with comparatively high accuracy. We designed DTE based NBKNN model for multi-class, single label text categorization. Our experiments suggest that data characteristics have considerable impact on the performance of classification methods. The Results obtained from Reuters-21578 corpus shows that NB-KNN consistently outperforms the single naive bayes and K-NN classifiers on Precision, Recall and Fl scores. The results of the study suggest designing a classification system in which several classification methods can be combined to increase the reliability, consistency and accuracy of the categorization.
  • Keywords
    Bayes methods; decision making; knowledge acquisition; neural nets; text analysis; NB-KNN approach; decision making tasks; discriminative term extraction method; news article categorization; single label text categorization; Conference management; Data mining; Decision making; Engineering management; Feature extraction; Knowledge management; Resource management; Risk management; Technology management; Text categorization; K-NN classifier; Text categorization; combined naïve bayes and K-NN classifier; discriminative term extraction; naïve bayes classifier; performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
  • Conference_Location
    Menuires
  • Print_ISBN
    978-1-4244-6081-6
  • Type

    conf

  • DOI
    10.1109/DBKDA.2010.18
  • Filename
    5477131