Title :
A variable length hash method for faster short read mapping on FPGA
Author :
Yoko Sogabe;Tsutomu Maruyama
Author_Institution :
Systems and Information Engineering, University of Tsukuba, 1-1-1 Ten-nou-dai Ibaraki 305-8573 JAPAN
Abstract :
Short read mapping is a process to align the short reads, which are fixed-length fragments of the target genome, to a given reference genome to identify the mutations in the target genome. Because of the rapid development of Next Generation Sequencing (NGS) technologies, faster short read mapping is required. In this paper, we propose a variable length hash method to further accelerate FPGA short read mapping systems. In the hash-based short read mapping algorithms, a fixed length sub-string of each short read, called seed, is used as the key. However, many different seeds are mapped into the same hash slots because of the high ununiformity of the human genome, and many fruitless key comparisons are performed. To equalize the slot size, we propose an optimized hash function that changes the bit masks adaptively. With this approach, it is possible to improve the performance of all FPGA short read mapping systems based on hash functions. The performance for the comparison in our FPGA system on a Xilinx XC7VX690T and XC6VLX240T can be improved two-times, and the total performance outperforms any existing FPGA systems.
Keywords :
"Indexes","Genomics","Bioinformatics","Field programmable gate arrays","Random access memory","Accuracy","Computers"
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2015 25th International Conference on
DOI :
10.1109/FPL.2015.7293938