• DocumentCode
    2507758
  • Title

    An Adaptive Fuzzy kNN Text Classifier Based on Gini Index Weight

  • Author

    Shang, Wenqian ; Qu, Youli ; Zhu, Haibin ; Huang, Houkuan ; Lin, Yongmin ; Dong, Hongbin

  • Author_Institution
    Beijing Jiaotong University, China
  • fYear
    2006
  • fDate
    26-29 June 2006
  • Firstpage
    448
  • Lastpage
    453
  • Abstract
    In recent years, kNN algorithm is paid attention by many researchers and is proved one of the best text categorization algorithms. Text categorization is according to training set, which is assigned class label to decide a new document, which is not assigned class label belongs to some kind of document. But for a classifier, text preprocessing is the bottleneck of categorization. In the original feature space, there are always thousands upon thousands words. The dimension of feature space is very high. So in this paper, we adopt a new feature weight method---- improved Gini index to reduce the dimension of feature space and improve the categorization precision. In addition, we discuss the improvement of decision rule and dimension selection. We design an adaptive fuzzy kNN text classifier. Here the adaptive indicate the adaptive of dimension selection. The experiment results show that our algorithm is effective and feasible.
  • Keywords
    Algorithm design and analysis; Computer science; Decision trees; Information technology; Internet; Least squares methods; Machine learning; Machine learning algorithms; Support vector machines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 2006. ISCC '06. Proceedings. 11th IEEE Symposium on
  • ISSN
    1530-1346
  • Print_ISBN
    0-7695-2588-1
  • Type

    conf

  • DOI
    10.1109/ISCC.2006.27
  • Filename
    1691068