DocumentCode
3339747
Title
Feature selection based on random forest and application in correlation analysis of symptom and disease
Author
Hu Xue-qin ; Cui Meng ; Chen Bing
Author_Institution
Inst. of Inf. on Traditional Chinese Med., China Acad. of Chinese Med. Sci., Beijing, China
Volume
1
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
120
Lastpage
124
Abstract
A set of simple, rational diagnosis mode is the effective premise for intelligent diagnosis model. In this paper, selected the important symptoms of "five endogenous pathogens (FEP)" and measured these symptoms\´ contribution degree to FEP were main contents of this paper. Focused on the disease characteristics of "FEP", we introduced the method of random forest (RF), and used it to build feature selection evaluation criteria, then proved the effectiveness of this method. On this basis, the article also explored the effective way to build an intelligent diagnosis model for "FEP". Comparative experiment shown that RF model was superior in the diagnosis performance than the multi-classification support vector machine (SVM) classifier, and proven it to be an effective and high-performance "FEP" diagnosis model.
Keywords
diseases; feature extraction; medical computing; patient diagnosis; pattern classification; FEP disease characteristics; SVM classifier comparison; disease-symptom correlation analysis; feature selection; five endogenous pathogens; intelligent diagnosis model; multiclassification support vector machine; random forest; rational diagnosis; Clinical diagnosis; Cognitive science; Diseases; Learning systems; Machine intelligence; Medical diagnostic imaging; Pathogens; Radio frequency; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
IT in Medicine & Education, 2009. ITIME '09. IEEE International Symposium on
Conference_Location
Jinan
Print_ISBN
978-1-4244-3928-7
Electronic_ISBN
978-1-4244-3930-0
Type
conf
DOI
10.1109/ITIME.2009.5236450
Filename
5236450
Link To Document