DocumentCode :
1665923
Title :
Design and Implementation of Network Transfer Protocol for Big Genomic Data
Author :
Aledhari, Mohammed ; Saeed, Fahad
Author_Institution :
Dept. of Comput. Sci., Western Michigan Univ., Kalamazoo, MI, USA
fYear :
2015
Firstpage :
281
Lastpage :
288
Abstract :
Genomic data is growing exponentially due to next generation sequencing technologies (NGS) and their ability to produce massive amounts of data in a short time. NGS technologies generate big genomic data that needs to be exchanged between different locations efficiently and reliably. The current network transfer protocols rely on Transmission Control Protocol (TCP) or User Data gram Protocol (UDP) protocols, ignoring data size and type. Universal application layer protocols such as HTTP are designed for wide variety of data types and are not particularly efficient for genomic data. Therefore, we present a new data-aware transfer protocol for genomic-data that increases network throughput and reduces latency, called Genomic Text Transfer Protocol (GTTP). In this paper, we design and implement a new network transfer protocol for big genomic DNA dataset that relies on the Hypertext Transfer Protocol (HTTP). Modification to content-encoding of HTTP has been done that would transfer big genomic DNA datasets using machine-to-machine (M2M) and client(s)-server topologies. Our results show that our modification to HTTP reduces the transmitted data by 75% of original data and still be able to regenerate the data at the client side for bioinformatics analysis. Consequently, the transfer of data using GTTP is shown to be much faster (about 8 times faster than HTTP) when compared with regular HTTP.
Keywords :
Big Data; DNA; bioinformatics; client-server systems; genomics; transport protocols; GTTP; HTTP; M2M topologies; NGS; TCP; UDP; big genomic DNA dataset; bioinformatics analysis; client-server topologies; data-aware transfer protocol; genomic text transfer protocol; hypertext transfer protocol; machine-to-machine topologies; network transfer protocol; next generation sequencing technologies; transmission control protocol; universal application layer protocols; user datagram protocol; Bioinformatics; Biological cells; DNA; Encoding; Genomics; Protocols; Servers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (BigData Congress), 2015 IEEE International Congress on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4673-7277-0
Type :
conf
DOI :
10.1109/BigDataCongress.2015.47
Filename :
7207231
Link To Document :
بازگشت