• DocumentCode
    2710201
  • Title

    An Effective Algorithm for Improving the Performance of Naive Bayes for Text Classification

  • Author

    GuoQiang

  • Author_Institution
    Higher Vocational Coll., Shanghai Univ. Of Eng. Sci., Shanghai, China
  • fYear
    2010
  • fDate
    7-10 May 2010
  • Firstpage
    699
  • Lastpage
    701
  • Abstract
    Naive Bayes algorithm is uncomplicated and effective in text classification and experiments. However, its performance is often imperfect because it does not model text well, and by inappropriate feature selection and some disadvantages of the Naive Bayes itself. This paper makes some modifications for Naive Bayes to improve the performance of Naive Bayes and the effect, condition as well, on categorization. Finally, the paper adopts this algorithm in Spam Filter categorization, a quite typical text classification. Some experiments were done with this method; results were compared with its previous method.
  • Keywords
    Bayes methods; learning (artificial intelligence); pattern classification; text analysis; feature selection; naive Bayes algorithm; spam filter categorization; text classification; Application software; Educational institutions; Electronic mail; Frequency; High performance computing; Information filtering; Information filters; Machine learning; Research and development; Text categorization; component; machine learning; naive Bayes classification; text classification; words frequencies;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Research and Development, 2010 Second International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-0-7695-4043-6
  • Type

    conf

  • DOI
    10.1109/ICCRD.2010.160
  • Filename
    5489530