• DocumentCode
    592836
  • Title

    Using frequency distance filteration for reducing database search workload on GPU-based cloud service

  • Author

    Sheng-Ta Lee ; Chun-Yuan Lin ; Che Lun Hung ; Hsuan Ying Huang

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Chang Gung Univ., Taoyuan, Taiwan
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    735
  • Lastpage
    740
  • Abstract
    The Smith-Waterman algorithm is the most widely used algorithm to analyze the similarity between protein and DNA sequences and suitable for the database search due to its high sensitivity. However, Smith-Waterman still is a very time-consuming method. CUDA programming can efficiently improve the computations by using the computing power of the massive computing hardware as GPUs. In this paper, we proposed an efficient frequency based filter method instead of just speed up the Smith-Waterman comparison but waste computing resource to deal with those unnecessary comparisons. We implemented the Smith-Waterman algorithm by introduction of the techniques from earlier researches and add in our real-time filter method on Graphic Processing Units to filter unnecessary comparisons. We also design a user friendly interface to provide the service in the potential clouding computing environment. In our research we choose two data sets, H1N1 VH protein database and Human protein database then compare CUDA-SW and CUDA-SW with filter, we called CUDA-SWf we can obtain up to 41% performance improve from reduce unnecessary sequence alignments.
  • Keywords
    biology computing; cloud computing; database management systems; graphics processing units; information retrieval; parallel architectures; proteins; user interfaces; CUDA programming; CUDA-SWf; DNA sequences; GPU-based cloud service; H1N1 VH protein database; Smith-Waterman algorithm; cloud computing environment; database search workload reduction; frequency based filter method; frequency distance filteration; graphics processing units; human protein database; real-time filter method; sequence alignments; user friendly interface design; Algorithm design and analysis; Arrays; Databases; Filtering algorithms; Graphics processing units; Instruction sets; Proteins; CUDA; GPGPU; Smith-Waterman; alignment; filter; sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing Technology and Science (CloudCom), 2012 IEEE 4th International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4673-4511-8
  • Electronic_ISBN
    978-1-4673-4509-5
  • Type

    conf

  • DOI
    10.1109/CloudCom.2012.6427539
  • Filename
    6427539