DocumentCode :
1762646
Title :
coreSNP: Parallel Processing of Microarray Data
Author :
Guzzi, Pietro H. ; Agapito, Giuseppe ; Cannataro, Mario
Author_Institution :
Dept. of Med. & Surg. Sci., Magna Graecia Univ. of Catanzaro, Catanzaro, Italy
Volume :
63
Issue :
12
fYear :
2014
fDate :
Dec. 2014
Firstpage :
2961
Lastpage :
2974
Abstract :
The availability of high-throughput technologies, such as next generation sequencing and microarray, and the diffusion of genomics studies to large populations are producing an increasing amount of experimental data. In particular, pharmacogenomics studies the impact of genetic variation on drug response in patients and correlates gene expression or single nucleotide polymorphisms (SNPs) with the toxicity or efficacy of a drug, with the aim to improve drug therapy with respect to the patients´ genotype ensuring maximum efficacy with minimal adverse effects. However, the storage, preprocessing, and analysis of experimental data are becoming a main bottleneck in the pharmacogenomics analysis pipeline, due to the increasing number of genes and patients investigated. This paper presents a new parallel software tool named coreSNP for the parallel preprocessing and statistical analysis of DMET (Drug Metabolism Enzymes and Transporters) SNP microarray data produced by Affymetrix for pharmacogenomics studies. The scalable multi-threaded implementation of coreSNP allows to handle the huge volumes of experimental pharmacogenomics data in a very efficient way, while its easy to use graphical user interface and its ability to annotate significant SNPs allow biologists to interpret the results easily. Performance evaluation conducted using real datasets shows good speed-up and scalability and effective response times.
Keywords :
bioinformatics; drugs; enzymes; genetics; genomics; graphical user interfaces; health care; information retrieval; lab-on-a-chip; multi-threading; statistical analysis; Affymetrix; DMET SNP microarray data; Drug Metabolism Enzymes and Transporters; SNP annotation; coreSNP parallel software tool; drug response; drug therapy improvement; drug toxicity; experimental data analysis; experimental data preprocessing; experimental data storage; gene expression; genetic variation; genomics diffusion; graphical user interface; high-throughput technologies; maximum drug efficacy; microarray data; minimal adverse effects; next generation sequencing; parallel processing; patient genotype; performance evaluation; pharmacogenomics analysis pipeline; response times; scalable multithreaded implementation; single-nucleotide polymorphisms; statistical analysis; Bioinformatics; DNA; Drugs; Genomics; Parallel processing; Statistical analysis; Throughput; Bioinformatics (genome or protein) databases; distributed programming; distributed systems; health care; healthcare; medical information systems; statistical software;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2013.176
Filename :
6587035
Link To Document :
بازگشت