• DocumentCode
    494438
  • Title

    Multi-class Text Categorization Based on Immune Algorithm

  • Author

    Zhang, Qirui ; Luo, Man ; Xue, Yonggang ; Tan, Jinghua

  • Author_Institution
    Coll. of Med. Inf. Eng., Guangdong Pharm. Univ., Guangzhou
  • Volume
    1
  • fYear
    2008
  • fDate
    21-22 Dec. 2008
  • Firstpage
    749
  • Lastpage
    752
  • Abstract
    The nature of immune algorithm is to distinguish between self and non-self. On the basis of immune algorithm, we propose a new method of text categorization called the Clonal Selection Algorithm Based on Antibody Density (CSABAD). According to the clonal selection principle and density control mechanism, only those cells that have higher affinity and lower density are selected to proliferate. The ultimate classifier is composed with many memory cells. Because CSABAD is a two-class algorithm, we discuss the main approaches how to apply it to the multi-class problem, such as one-vs-one (OVO), one-vs-rest (OVR) and directed acyclic graph (DAG). Our experiments show that one-vs-one obtains the best classification performance on the 20_newsgroups dataset. In addition, the experiment results also show that it significantly outperforms Rocchio and Naive Bayes.
  • Keywords
    artificial immune systems; pattern classification; text analysis; antibody density; clonal selection algorithm; density control mechanism; directed acyclic graph; immune algorithm; memory cells; multiclass text categorization; Animals; Automatic control; Classification algorithms; Immune system; Machine learning; Machine learning algorithms; Pattern recognition; Support vector machine classification; Support vector machines; Text categorization; antibody density; clonal selection; immune algorithm; multi class; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008. International Workshop on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3563-0
  • Type

    conf

  • DOI
    10.1109/ETTandGRS.2008.337
  • Filename
    5070261