• DocumentCode
    1932497
  • Title

    A comparison of several ensemble methods for text categorization

  • Author

    Dong, Yan-Shi ; Han, Ke-Song

  • Author_Institution
    Shanghai Jiao Tong Univ., China
  • fYear
    2004
  • fDate
    15-18 Sept. 2004
  • Firstpage
    419
  • Lastpage
    422
  • Abstract
    Text categorization (TC), as an important domain of machine learning, has many unique traits, such as huge number of features, serious redundant features, dataset imbalance, etc. In this paper the various ensemble methods of naive Bayes classifiers and SVM classifiers are experimentally compared on the TC tasks. Besides, a new type of classifiers, moderated asymmetric naive Bayes classifiers, is proposed. Its advantages over the conventional naive Bayes classifiers in performance and computational efficiency are demonstrated.
  • Keywords
    belief networks; learning (artificial intelligence); pattern classification; support vector machines; text analysis; Bayes classifier; machine learning; support vector machine classifier; text categorization; Bagging; Boosting; Computational efficiency; Machine learning; Neural networks; Niobium; Stacking; Support vector machine classification; Support vector machines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Services Computing, 2004. (SCC 2004). Proceedings. 2004 IEEE International Conference on
  • Print_ISBN
    0-7695-2225-4
  • Type

    conf

  • DOI
    10.1109/SCC.2004.1358033
  • Filename
    1358033