Title :
Complexity measures of supervised classifications tasks: A case study for cancer gene expression data
Author :
De Souto, Marcilio C P ; Lorena, Ana C. ; Spolaôr, Newton ; Costa, Ivan G.
Author_Institution :
Dept. of Inf. & Appl. Math., Fed. Univ. of Rio Grande do Norte, Rio Grande, Brazil
Abstract :
Machine Learning algorithms have been widely used for gene expression data classification, despite the fact that these data have often intrinsic limitations, such as high dimensionality and a small number of examples. Few studies try to characterize to which extent these aspects can influence the performance of the classification models induced. In this paper we compute different measures characterizing the complexity of gene expression data sets for cancer diagnosis. We then investigate how these measures relate to the classification performances achieved by support vector machines, a popular Machine Learning technique usually employed in the analysis of gene expression data. The results obtained indicate that some of the complexity indices utilized are indeed successful in explaining the difficulty involved in the classification of cancer gene expression data.
Keywords :
cancer; classification; genetics; learning (artificial intelligence); medical computing; patient diagnosis; support vector machines; cancer diagnosis; cancer gene expression data; complexity measures; machine learning; supervised classifications tasks; support vector machines; Cancer; Complexity theory; Error analysis; Gene expression; Indexes; Noise; Support vector machines;
Conference_Titel :
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6916-1
DOI :
10.1109/IJCNN.2010.5596305