• Title of article

    Automatically computed document dependent weighting factor facility for Naïve Bayes classification

  • Author/Authors

    Lee، نويسنده , , Lam Hong and Isa، نويسنده , , Dino، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2010
  • Pages
    8
  • From page
    8471
  • To page
    8478
  • Abstract
    The Naïve Bayes classification approach has been widely implemented in real-world applications due to its simplicity and low cost training and classifying algorithm. As a trade-off to its simplicity, the Naïve Bayes technique has thus been reported to be one of the poorest-performing classification methods around. We have explored and investigated the Naïve Bayes classification approach and found that one of the reasons that causes the low classification accuracy is the mis-classification of documents into several “popular” categories due to the improper organization of the training dataset where the distribution of training documents among categories is highly skewed. In this work, we propose a solution to the problem addressed above, which is the addition of the Automatically Computed Document Dependent (ACDD) weighting factor facility to the Naïve Bayes classifier. The ACDD weighting factors are computed for the purpose of enhancing the classification performance by adjusting the probability values based on the density of classified documents in each available category to minimize the mis-classification rate.
  • Keywords
    Text document classification , ACDD weighting factor facility , naïve Bayes
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2010
  • Journal title
    Expert Systems with Applications
  • Record number

    2348572