• DocumentCode
    2731783
  • Title

    Parallel Implementation of a Bioinformatics Pipeline for the Design of Pathogen Diagnostic Assays

  • Author

    Satya, Ravi Vijaya ; Kumar, Kamal ; Zavaljevski, Nela ; Reifman, Jaques

  • Author_Institution
    Telemedicine & Adv. Technol. Res. Center, US Army Med. Res. & Materiel Command (MRMC), Fort Detrick, MD, USA
  • fYear
    2009
  • fDate
    15-18 June 2009
  • Firstpage
    213
  • Lastpage
    218
  • Abstract
    The genomes of hundreds of pathogens and their near neighbors are now available and many more are being sequenced. With the availability of this genome information, sequence-based pathogen identification has become an increasingly important tool for clinical diagnostics and environmental monitoring of biological threat agents. Chief among sequence-based identification tools are DNA microarrays, which have the ability to test for thousands of pathogens in a single diagnostic test. The design of microarray diagnostic assays involves the identification of short DNA sequences unique to a pathogen or groups of pathogens, where these unique sequences, or “fingerprints” (also referred to as probes) are used to identify the pathogens. To design pathogen fingerprints, we developed TOFI (Tool for Oligonucleotide Fingerprint Identification), a high performance computing software pipeline that designs microarray probes for multiple related pathogens in a single run. The TOFI pipeline is extremely efficient in designing microarray fingerprints for multiple pathogens. Parallel implementation of computationally expensive specificity analysis of the designed fingerprints drastically reduces the overall execution time of the software. Comprehensive performance analysis shows that TOFI achieves super-linear speedup for up to 74 processors. A Web-based user interface, developed using the User Interface Toolkit, provides easy access to the pipeline. Using 74 processors, TOFI took approximately nine hours to design 5,015 in-silico probes for eight Burkholderia genomes with a combined size of more than 50 million base pairs. Experimental validation of these probes with various Burkholderia genomes showed that nearly 80% of the designed fingerprints identify the intended targets.
  • Keywords
    Internet; bioinformatics; lab-on-a-chip; patient diagnosis; user interfaces; Burkholderia genomes; DNA microarrays; DNA sequences; Web-based user interface; bioinformatics pipeline; biological threat agents; clinical diagnostics; environmental monitoring; genome information; microarray diagnostic assays; microarray probes; pathogen diagnostic assays; pathogen fingerprints; sequence-based pathogen identification; tool for oligonucleotide fingerprint identification; user interface toolkit; Databases; Fingerprint recognition; Genomics; Pathogens; Pipelines; Probes; Program processors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    DoD High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC), 2009
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-1-4244-5768-7
  • Type

    conf

  • DOI
    10.1109/HPCMP-UGC.2009.36
  • Filename
    5729467