Title :
Reliability of Geo-replicated Cloud Storage Systems
Author :
Iliadis, Ilias ; Sotnikov, Dmitry ; Ta-Shma, Paula ; Venkatesan, V.
Abstract :
Network bandwidth between sites is typically more scarce than bandwidth within a site in geo-replicated cloud storage systems, and can potentially be a bottleneck for recovery operations. We study the reliability of geo-replicated cloud storage systems taking into account different bandwidths within a site and between sites. We consider a new recovery scheme called staged rebuild and compare it with both a direct scheme and a scheme known as intelligent rebuild. To assess the reliability gains achieved by these schemes, we develop an analytical model that incorporates various relevant aspects of storage systems, such as bandwidths, latent sector errors, and failure distributions. The model applies in the context of Open Stack Swift, a widely deployed cloud storage system. Under certain practical system configurations, we establish that order of magnitude improvements in mean time to data loss (MTTDL) can be achieved using these schemes.
Keywords :
cloud computing; replicated databases; MTTDL; OpenStack Swift; analytical model; direct scheme; failure distributions; geo-replicated cloud storage system reliability; intelligent rebuild scheme; latent sector errors; magnitude improvement-in-mean time-to-data loss; network bandwidth; recovery operations; reliability gain assessment; staged rebuild recovery scheme; Artificial intelligence; Bandwidth; Cloud computing; Correlation; Redundancy; Reliability theory; MTTDL; cloud storage; geo-replication; recovery;
Conference_Titel :
Dependable Computing (PRDC), 2014 IEEE 20th Pacific Rim International Symposium on
Conference_Location :
Singapore
Print_ISBN :
978-1-4799-6473-4
DOI :
10.1109/PRDC.2014.30