DocumentCode
627942
Title
Acyclic Identification of Aptamer from Over-Represented Libraries Using Hash Functions
Author
Yiou Xiao ; Mehrotra, Kishan G. ; Mohan, Chilukuri K. ; Borer, Phillip N. ; Allis, Damian G.
Author_Institution
Dept. of EECS, Syracuse Univ., Syracuse, NY, USA
fYear
2013
fDate
5-7 April 2013
Firstpage
179
Lastpage
179
Abstract
In recent years, with the advent of fast sequencing technology, the genomic database is growing rapidly. Researchers in bioinformatics field are expecting faster and more accurate tools to effectively analyze the gigantic data sets. In the context of aptamer search, the goal is to search for the over-represented DNA sequences compared with random background libraries on the same chip. Hash functions are widely used in substring comparison, sequence alignment and clustering tools. We have developed a light-weighted tool that takes advantage of the hash functions to reduce the size of genomic data and conducts k-neighbor searches on the centroid sequence. This greatly improves the efficiency of the search compared with the existing tool. Furthermore, the calculation of k-neighbor hash values decreases the mutant searching overhead. In a dataset of 1 million sequences, the program accurately counted the frequency of the Human alpha-Thrombin sequence and found the mutant versions of the target sequence in less than 40 seconds, whereas the existing method takes 8280 seconds (2 hours 13 minutes).
Keywords
DNA; bioinformatics; genomics; molecular configurations; organic compounds; DNA sequences; acyclic aptamer identification; aptamer search; bioinformatic field; centroid sequence; clustering tools; fast sequencing technology; genomic data size; genomic database; gigantic data sets; hash functions; human alpha-Thrombin sequence; k-neighbor hash values; k-neighbor searches; light-weighted tool; random background libraries; sequence alignment; Bioinformatics; Biomedical engineering; DNA; Educational institutions; Genomics; Libraries; Sequential analysis; Apatmer; DNA; Hash; Overrepresented library;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioengineering Conference (NEBEC), 2013 39th Annual Northeast
Conference_Location
Syracuse, NY
ISSN
2160-7001
Print_ISBN
978-1-4673-4928-4
Type
conf
DOI
10.1109/NEBEC.2013.2
Filename
6574416
Link To Document