• DocumentCode
    2938140
  • Title

    Galileo: A Framework for Distributed Storage of High-Throughput Data Streams

  • Author

    Malensek, Matthew ; Pallickara, Sangmi Lee ; Pallickara, Shrideep

  • Author_Institution
    Dept. of Comput. Sci., Colorado State Univ., Fort Collins, CO, USA
  • fYear
    2011
  • fDate
    5-8 Dec. 2011
  • Firstpage
    17
  • Lastpage
    24
  • Abstract
    We describe the design of a high-throughput storage system, Galileo, for data streams generated in observational settings. The shared-nothing architecture in Galileo supports incremental assimilation of nodes, while accounting for heterogeneity in their capabilities, to cope with data volumes. To achieve efficient storage and retrievals of data, Galileo accounts for the geospatial and chronological characteristics of such time-series observational data streams. Our benchmarks demonstrate that Galileo supports high-throughput storage and efficient retrievals of specific portions of large datasets while supporting different types of queries.
  • Keywords
    distributed processing; storage management; Galileo; distributed storage; high-throughput data streams; high-throughput storage system; shared-nothing architecture; Computer architecture; File systems; Geospatial analysis; Indexes; Runtime; Temperature measurement; commodity clusters; data storage; distributed systems; observational streams; query evaluations; scale-out architectures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Utility and Cloud Computing (UCC), 2011 Fourth IEEE International Conference on
  • Conference_Location
    Victoria, NSW
  • Print_ISBN
    978-1-4577-2116-8
  • Type

    conf

  • DOI
    10.1109/UCC.2011.13
  • Filename
    6123476