DocumentCode :
3664187
Title :
Perfect Hashing Structures for Parallel Similarity Searches
Author :
Tuan Tu Tran;Mathieu Giraud;Jean-Stéphane Varré
Author_Institution :
Johannes Gutenberg-Univ. Mainz Mainz, Mainz, Germany
fYear :
2015
fDate :
5/1/2015 12:00:00 AM
Firstpage :
332
Lastpage :
341
Abstract :
Seed-based heuristics have proved to be efficient for studying similarity between genetic databases with billions of base pairs. This paper focuses on algorithms and data structures for the filtering phase in seed-based heuristics, with an emphasis on efficient parallel GPU/many cores implementation. We propose a 2-stage index structure which is based on neighborhood indexing and perfect hashing techniques. This structure performs a filtering phase over the neighborhood regions around the seeds in constant time and avoid as much as possible random memory accesses and branch divergences. Moreover, it fits particularly well on parallel SIMD processors, because it requires intensive but homogeneous computational operations. Using this data structure, we developed a fast and sensitive Open CL prototype read mapper.
Keywords :
"Indexing","Graphics processing units","Bioinformatics","Data structures","Pattern matching"
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International
Type :
conf
DOI :
10.1109/IPDPSW.2015.105
Filename :
7284329
Link To Document :
بازگشت