DocumentCode :
1877175
Title :
Compute Pairwise Manhattan Distance and Pearson Correlation Coefficient of Data Points with GPU
Author :
Chang, Dar-Jen ; Desoky, Ahmed H. ; Ouyang, Ming ; Rouchka, Eric C.
Author_Institution :
Comput. Eng. & Comput. Sci. Dept., Univ. of Louisville, Louisville, KY, USA
fYear :
2009
fDate :
27-29 May 2009
Firstpage :
501
Lastpage :
506
Abstract :
Graphics processing units (GPUs) are powerful computational devices tailored towards the needs of the 3-D gaming industry for high-performance, real-time graphics engines. Nvidia Corporation released a new generation of GPUs designed for general-purpose computing in 2006, and it released a GPU programming language called CUDA in 2007. The DNA microarray technology is a high throughput tool for assaying mRNA abundance in cell samples. In data analysis, scientists often apply hierarchical clustering of the genes, where a fundamental operation is to calculate all pairwise distances. If there are n genes, it takes O(n^2) time. In this work, GPUs and the CUDA language are used to calculate pairwise distances. For Manhattan distance, GPU/CUDA achieves a 40 to 90 times speed-up compared to the central processing unit implementation; for Pearson correlation coefficient, the speed-up is 28 to 38 times.
Keywords :
biology computing; computational complexity; computer games; computer graphics; coprocessors; correlation methods; data analysis; lab-on-a-chip; pattern clustering; software architecture; 3-D gaming industry; CUDA; DNA microarray technology; GPU programming language; Manhattan distance; Nvidia corporation; Pearson correlation coefficient; cell samples; central processing unit; computational devices; compute unified device architecture; data analysis; data points; general-purpose computing; genes hierarchical clustering; graphics processing units; high throughput tool; mRNA; pairwise distances calculation; real-time graphics engines; Bioinformatics; Central Processing Unit; Computer networks; Concurrent computing; DNA; Data analysis; Distributed computing; Graphics; Power engineering computing; Sequences; Parallel and distributed computation; hierarchical clustering; similarity and dissimilarity metrics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing, 2009. SNPD '09. 10th ACIS International Conference on
Conference_Location :
Daegu
Print_ISBN :
978-0-7695-3642-2
Type :
conf
DOI :
10.1109/SNPD.2009.34
Filename :
5286618
Link To Document :
بازگشت