DocumentCode
2938140
Title
Galileo: A Framework for Distributed Storage of High-Throughput Data Streams
Author
Malensek, Matthew ; Pallickara, Sangmi Lee ; Pallickara, Shrideep
Author_Institution
Dept. of Comput. Sci., Colorado State Univ., Fort Collins, CO, USA
fYear
2011
fDate
5-8 Dec. 2011
Firstpage
17
Lastpage
24
Abstract
We describe the design of a high-throughput storage system, Galileo, for data streams generated in observational settings. The shared-nothing architecture in Galileo supports incremental assimilation of nodes, while accounting for heterogeneity in their capabilities, to cope with data volumes. To achieve efficient storage and retrievals of data, Galileo accounts for the geospatial and chronological characteristics of such time-series observational data streams. Our benchmarks demonstrate that Galileo supports high-throughput storage and efficient retrievals of specific portions of large datasets while supporting different types of queries.
Keywords
distributed processing; storage management; Galileo; distributed storage; high-throughput data streams; high-throughput storage system; shared-nothing architecture; Computer architecture; File systems; Geospatial analysis; Indexes; Runtime; Temperature measurement; commodity clusters; data storage; distributed systems; observational streams; query evaluations; scale-out architectures;
fLanguage
English
Publisher
ieee
Conference_Titel
Utility and Cloud Computing (UCC), 2011 Fourth IEEE International Conference on
Conference_Location
Victoria, NSW
Print_ISBN
978-1-4577-2116-8
Type
conf
DOI
10.1109/UCC.2011.13
Filename
6123476
Link To Document