DocumentCode
494438
Title
Multi-class Text Categorization Based on Immune Algorithm
Author
Zhang, Qirui ; Luo, Man ; Xue, Yonggang ; Tan, Jinghua
Author_Institution
Coll. of Med. Inf. Eng., Guangdong Pharm. Univ., Guangzhou
Volume
1
fYear
2008
fDate
21-22 Dec. 2008
Firstpage
749
Lastpage
752
Abstract
The nature of immune algorithm is to distinguish between self and non-self. On the basis of immune algorithm, we propose a new method of text categorization called the Clonal Selection Algorithm Based on Antibody Density (CSABAD). According to the clonal selection principle and density control mechanism, only those cells that have higher affinity and lower density are selected to proliferate. The ultimate classifier is composed with many memory cells. Because CSABAD is a two-class algorithm, we discuss the main approaches how to apply it to the multi-class problem, such as one-vs-one (OVO), one-vs-rest (OVR) and directed acyclic graph (DAG). Our experiments show that one-vs-one obtains the best classification performance on the 20_newsgroups dataset. In addition, the experiment results also show that it significantly outperforms Rocchio and Naive Bayes.
Keywords
artificial immune systems; pattern classification; text analysis; antibody density; clonal selection algorithm; density control mechanism; directed acyclic graph; immune algorithm; memory cells; multiclass text categorization; Animals; Automatic control; Classification algorithms; Immune system; Machine learning; Machine learning algorithms; Pattern recognition; Support vector machine classification; Support vector machines; Text categorization; antibody density; clonal selection; immune algorithm; multi class; text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008. International Workshop on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3563-0
Type
conf
DOI
10.1109/ETTandGRS.2008.337
Filename
5070261
Link To Document