DocumentCode :
1640573
Title :
HT-RLS: High-Throughput Web Tool for Analysis of DNA Microarray Data Using RLS classifiers
Author :
Meo, P. D´ Onorio De ; Carrabino, D. ; Antonio, M. D´ ; Sanna, N. ; Castrignanò, T. ; Maglietta, R. ; Addabbo, A. D´ ; Liuni, S. ; Mignone, F. ; Pesole, G. ; Ancona, N.
Author_Institution :
CASPUR, Rome
fYear :
2008
Firstpage :
747
Lastpage :
752
Abstract :
Gene expression from DNA microarray data offers biologists and pathologists the possibility to deal with the problem of disease (e. g. cancer) diagnosis and prognosis from a quantitative point of view. Microarray data provide a snapshot of the molecular status of a sample of cells in a given tissue, returning the expression levels of thousands of genes simultaneously. Several mathematical methods from learning theory, such as Regularized Least Squares (RLS) classifiers or Support Vector Machines (SVM), have been extensively adopted to classify gene expression data. These methods can be useful to answer some relevant questions such as 1) what is the right amount of data to build an accurate classifier? 2) How many and which genes are correlated with a specific pathology? The computational analysis to statistically estimate the accuracy of the chosen models is particularly time consuming, burning several days of CPU time and without high-throughput or high- performance tools becomes practically unfeasible to obtain results in a reasonable time for biomedical community. We have implemented an independent, flexible and scalable platform, for a high-throughput large-scale microarray gene expression data analysis and classification, based on R tool for statistical computing. It integrates databases and computational intensive algorithms, based on RLS classifiers and a powerful web client for data training and graphical visualization of predicted results. Our platform provides statistically significant answers to the study of the gene expression by means of microarray data and supplying useful information to relevant questions in the diagnosis and prognosis of diseases in a reasonable time. The web resource is available free of charge for academic and non-profit institutions.
Keywords :
DNA; Internet; biology computing; data visualisation; pattern classification; DNA microarray data analysis; HT-RLS; RLS classifiers; Web resource; computational analysis; data classification; graphical visualization; high-throughput Web tool; high-throughput microarray gene expression data analysis; large-scale microarray gene expression data analysis; learning theory; regularized least squares classifiers; support vector machines; Biomedical computing; Cancer; DNA; Diseases; Gene expression; Least squares methods; Machine learning; Resonance light scattering; Support vector machine classification; Support vector machines; RLS classifiers; high throughput; microarray; service-oriented architectures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2008. CCGRID '08. 8th IEEE International Symposium on
Conference_Location :
Lyon
Print_ISBN :
978-0-7695-3156-4
Electronic_ISBN :
978-0-7695-3156-4
Type :
conf
DOI :
10.1109/CCGRID.2008.108
Filename :
4534298
Link To Document :
بازگشت