Title :
Learning feature selection for medical databases
Author :
Skrypnik, Irina ; Terziyan, Vagan ; Puuronen, Seppo ; Tsymbal, Alexey
Author_Institution :
Metaintelligence Lab., Kharkov State Tech. Univ. of Radio-Electron., Ukraine
Abstract :
In many application areas, as in medicine, knowledge is discovered from large databases using data mining methods. It is evident that quite often even with static data mining methods the accuracy of the results of data mining is better if fewer features are selected or if the classification task is divided into sub tasks guided by the heterogeneity of the feature space. On the other hand, one of the most important directions in the improvement of data mining and knowledge discovery is to integrate multiple classification techniques included in an ensemble of classifiers. We use two variations of an advanced dynamic integration technique, which first build an ensemble of classifiers containing base classifiers based on the subsets of the original feature set, then evaluate the competence areas of the base classifiers inside the application domain, and finally select a classifier to produce the final classification result dynamically. The technique is evaluated on two data sets taken from the UCI machine learning repository and a comparison of the results show that our dynamic integration technique with classifiers based on reduced feature sets is able to produce more accurate results than C4.5 with the whole set of features on one data set and that our method outperforms the cross-validation majority technique and is comparable with weighted voting on some data sets
Keywords :
data mining; learning (artificial intelligence); medical information systems; UCI machine learning repository; data mining; dynamic integration; feature selection learning; knowledge discovery; medical databases; Spatial databases;
Conference_Titel :
Computer-Based Medical Systems, 1999. Proceedings. 12th IEEE Symposium on
Conference_Location :
Stamford, CT
Print_ISBN :
0-7695-0234-2
DOI :
10.1109/CBMS.1999.781248