DocumentCode :
244597
Title :
Optimized ASIP architecture for compressed BWT-indexed search in bioinformatics applications
Author :
Sebastio, Nuno ; Flores, Paulo ; Roma, Nuno
Author_Institution :
Inst. Super. Tecnico, Univ. de Lisboa, Lisbon, Portugal
fYear :
2014
fDate :
21-25 July 2014
Firstpage :
527
Lastpage :
534
Abstract :
Compressed indexes are adopted by a vast set of bioinformatics applications that deal with extremely large datasets, mainly due to the inherently high memory requirements of uncompressed alternatives. However, the additional computational overhead that is imposed by the usage of such indexes makes them harder to implement in embedded computational platforms, such as biochips, with strict processing and power restrictions. Furthermore, compressed indexes are often characterized by a significant usage of bit-level operations, some of which are not commonly available on General Purpose Processors (GPPs). To circumvent this limitation, an Application-Specific Instruction-set Processor (ASIP) architecture is proposed to accelerate the processing of biological sequences (e.g., alignment, mapping, etc.) using compressed full-text indexes based on the Burrows-Wheeler Transform (BWT). The proposed processor was built over a RISC micro-architecture and extends the Xilinx MicroBlaze ISA with additional bit-level operations, especially tailored for compressed indexes. When used to perform search operations over the considered compressed index, the proposed architecture provides a reduction of the number of required instructions by about one half. Furthermore, when prototyped on a Xilinx Virtex-7 FPGA, the ASIP proved to offer an overall speedup between 3.1x and 4.5x for the execution of a single threaded operation. To ensure a further processing scalability, the proposed ASIP was designed in order to be easily used as the basic processing unit of multi-core systems, especially tuned for the parallel processing of massive datasets of biological reads.
Keywords :
bioinformatics; data compression; general purpose computers; instruction sets; multiprocessing systems; optimisation; parallel processing; ASIP architecture optimization; Burrows-Wheeler transform; GPPs; application-specific instruction-set processor; bioinformatics applications; biological sequence processing; compressed BWT-indexed search; general purpose processors; multicore systems; parallel processing; Bioinformatics; Data structures; Indexes; Lead; Memory management; Program processors; Application-Specific Instruction-Set Processor; Compressed text indexes; DNA alignment; Heuristic algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing & Simulation (HPCS), 2014 International Conference on
Conference_Location :
Bologna
Print_ISBN :
978-1-4799-5312-7
Type :
conf
DOI :
10.1109/HPCSim.2014.6903731
Filename :
6903731
Link To Document :
بازگشت