Title of article
Gene expression data analysis of human lymphoma using support vector machines and output coding ensembles
Author/Authors
Valentini، نويسنده , , Giorgio، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2002
Pages
24
From page
281
To page
304
Abstract
The large amount of data generated by DNA microarrays was originally analysed using unsupervised methods, such as clustering or self-organizing maps. Recently supervised methods such as decision trees, dot-product support vector machines (SVM) and multi-layer perceptrons (MLP) have been applied in order to classify normal and tumoural tissues. We propose methods based on non-linear SVM with polynomial and Gaussian kernels, and output coding (OC) ensembles of learning machines to separate normal from malignant tissues, to classify different types of lymphoma and to analyse the role of sets of coordinately expressed genes in carcinogenic processes of lymphoid tissues. Using gene expression data from “Lymphochip”, a specialised DNA microarray developed at Stanford University School of Medicine, we show that SVM can correctly separate normal from tumoural tissues, and OC ensembles can be successfully used to classify different types of lymphoma. Moreover, we identify a group of coordinately expressed genes related to the separation of two distinct subgroups inside diffuse large B-cell lymphoma (DLBCL), validating a previous Alizadeh’s hypothesis about the existence of two distinct diseases inside DLBCL.
Keywords
Gene expression data analysis , Support Vector Machines , DNA microarrays , Output coding ensembles of learning machines
Journal title
Artificial Intelligence In Medicine
Serial Year
2002
Journal title
Artificial Intelligence In Medicine
Record number
1835948
Link To Document