• DocumentCode
    182984
  • Title

    A novel feature voting model for text classification

  • Author

    Sen Jia ; Jinquan Liang ; Yao Xie ; Lin Deng

  • Author_Institution
    Key Lab. of Spatial Inf. Intell. Perception & Services, Shenzhen Univ., Shenzhen, China
  • fYear
    2014
  • fDate
    19-21 Aug. 2014
  • Firstpage
    306
  • Lastpage
    311
  • Abstract
    Along with the information explosion in the Internet era, the traditional classification methods, such as KNN (k-nearest neighbor), Naive Bayes (NB), encounter bottlenecks due to the endless stream of new words. In this paper, through comparing with the Rocchio and Bayesian algorithms, it has been found that centroid-based algorithms are insufficient for text classification. Therefore, a novel feature voting model is proposed, which gives rise to a bag-of-words based feature voting algorithm for text classification. This algorithm assigns categories for each document according to the ranking of weighted sum of feature values. Experimental results have shown the efficiency of the proposed method over the other state-of-the-art methods.
  • Keywords
    Bayes methods; pattern classification; text analysis; Bayesian algorithm; Internet; KNN; NB; Rocchio algorithm; bag-of-words based feature voting algorithm; centroid-based algorithms; information explosion; k-nearest neighbor; naive Bayes; text classification; Accuracy; Classification algorithms; Equations; Internet; Mathematical model; Training; Vectors; Naive Bayes; feature voting; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4799-5147-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2014.6980851
  • Filename
    6980851