Title :
Multi-class Text Categorization Based on Immune Algorithm
Author :
Zhang, Qirui ; Luo, Man ; Xue, Yonggang ; Tan, Jinghua
Author_Institution :
Coll. of Med. Inf. Eng., Guangdong Pharm. Univ., Guangzhou
Abstract :
The nature of immune algorithm is to distinguish between self and non-self. On the basis of immune algorithm, we propose a new method of text categorization called the Clonal Selection Algorithm Based on Antibody Density (CSABAD). According to the clonal selection principle and density control mechanism, only those cells that have higher affinity and lower density are selected to proliferate. The ultimate classifier is composed with many memory cells. Because CSABAD is a two-class algorithm, we discuss the main approaches how to apply it to the multi-class problem, such as one-vs-one (OVO), one-vs-rest (OVR) and directed acyclic graph (DAG). Our experiments show that one-vs-one obtains the best classification performance on the 20_newsgroups dataset. In addition, the experiment results also show that it significantly outperforms Rocchio and Naive Bayes.
Keywords :
artificial immune systems; pattern classification; text analysis; antibody density; clonal selection algorithm; density control mechanism; directed acyclic graph; immune algorithm; memory cells; multiclass text categorization; Animals; Automatic control; Classification algorithms; Immune system; Machine learning; Machine learning algorithms; Pattern recognition; Support vector machine classification; Support vector machines; Text categorization; antibody density; clonal selection; immune algorithm; multi class; text categorization;
Conference_Titel :
Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008. International Workshop on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3563-0
DOI :
10.1109/ETTandGRS.2008.337