Title :
Diagnostic prediction of multi-class cancer using SVM and nearest neighbor classifier
Author :
Kar, Soummya ; Das Sharma, Kaushik ; Maitra, Madhubanti
Author_Institution :
Dept. of Electr. Eng., Future Inst. of Eng. & Manage., Kolkata, India
fDate :
Jan. 31 2014-Feb. 2 2014
Abstract :
Precise diagnosis of four heterogeneous childhood cancers, namely, neuroblastoma, non-Hodgkin lymphoma, rhabdomyosarcoma and Ewing sarcoma is crucial because they present a similar histology of small round blue cell tumors (SRBCTs) and frequently leads to misdiagnosis. However, due to small number of samples compared to very large number of genes in microarray gene expression data, it is hard to identify a small subset of relevant genes that can classify these four subgroups of childhood cancers with high accuracy. Therefore, in this paper, we have utilized t-test to rank all the genes according to their importance. Support vector machine (SVM) with different kernels and a simple 1-nearest neighbor (1-NN) classifier have been used to perform the classification task. Results demonstrate that the method could find very few numbers of genes for the diagnostic prediction of cancer subgroups.
Keywords :
cancer; lab-on-a-chip; medical diagnostic computing; patient diagnosis; pattern classification; support vector machines; tumours; 1-NN classifier; 1-nearest neighbor classifier; Ewing sarcoma; SRBCTs; SVM; childhood cancer classification; heterogeneous childhood cancer diagnosis; microarray gene expression data; multiclass cancer diagnostic prediction; neuroblastoma; nonHodgkin lymphoma; rhabdomyosarcoma; small round blue cell tumors; support vector machine; t-test; Accuracy; Cancer; Gene expression; Kernel; Niobium; Support vector machines; Training; 1-nearest neighbor; Cancer subgroups; T-test; identification of relevant Genes; support vector machine;
Conference_Titel :
Control, Instrumentation, Energy and Communication (CIEC), 2014 International Conference on
Conference_Location :
Calcutta
DOI :
10.1109/CIEC.2014.6959167