• DocumentCode
    2736961
  • Title

    Efficient pairwise statistical significance estimation for local sequence alignment using GPU

  • Author

    Zhang, Yuhong ; Misra, Sanchit ; Honbo, Daniel ; Agrawal, Ankit ; Liao, Wei-keng ; Choudhary, Alok

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • fYear
    2011
  • fDate
    3-5 Feb. 2011
  • Firstpage
    226
  • Lastpage
    231
  • Abstract
    Pairwise statistical significance has been found to be quite accurate in identifying related sequences (homologs), which is a key step in numerous bioinformatics applications. However, it is computational and data intensive, particularly for a large amount of sequence data. To prevent it from becoming a performance bottleneck, we resort to Graphics Processing Units (GPUs) for accelerating the computation. In this paper, we present a GPU memory-access optimized implementation for a pairwise statistical significance estimation algorithm. By exploring the algorithm´s data access characteristics, we developed a tile-based scheme that can produce a contiguous memory accesses pattern to GPU global memory and sustain a large number of threads to achieve a high GPU occupancy. Our experimental results present both single- and multi-pair statistical significance estimations. The performance evaluation was carried out on an NVIDIA Telsa C2050 GPU. We observe more than 180× end-to-end speedup over the CPU implementation on an Intel© Core™ i7 processor. The proposed memory access optimizations and efficient framework are also applicable to many other sequence comparison based applications, such as DNA sequence mapping and database search.
  • Keywords
    DNA; bioinformatics; molecular biophysics; statistical analysis; DNA sequence; GPU; NVIDIA Telsa C2050 GPU; bioinformatics; graphics processing units; local sequence alignment; memory access; pairwise statistical significance estimation; Databases; Estimation; Graphics processing unit; Instruction sets; Kernel; Layout; Parallel processing; GPU; pairwise sequence alignment; statistical significance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    978-1-61284-851-8
  • Type

    conf

  • DOI
    10.1109/ICCABS.2011.5729885
  • Filename
    5729885