• DocumentCode
    172978
  • Title

    Using Elasticity to Improve Inline Data Deduplication Storage Systems

  • Author

    Yufeng Wang ; Tan, C.C. ; Ningfang Mi

  • Author_Institution
    Temple Univ., Philadelphia, PA, USA
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    785
  • Lastpage
    792
  • Abstract
    Elasticity is the ability to scale computing resources such as memory on-demand, and is one of the main advantages of utilizing cloud computing services. With the increasing popularity of cloud based storage, it is natural that more deduplication based storage systems will be migrated to the cloud. Existing deduplication systems however, do not adequately take advantage of elasticity. In this paper, we illustrate how to use elasticity to improve deduplication based systems, and propose EAD (elasticity aware deduplication), an indexing algorithm that uses the ability to dynamically increase memory resources to improve overall deduplication performance. Our experimental results indicate that EAD is able to detect more than 98% of all duplicate data, however only consumes less than 5% of expected memory space. Meanwhile, it claims four times of deduplication efficiency than the state-of-art sampling technique while costs less than half of the amount of memory.
  • Keywords
    cloud computing; data handling; database indexing; storage management; EAD; cloud based storage; cloud computing services; elasticity aware deduplication; indexing algorithm; inline data deduplication storage systems; memory on-demand; Elasticity; Estimation; Heuristic algorithms; Indexes; Memory management; Random access memory; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5062-1
  • Type

    conf

  • DOI
    10.1109/CLOUD.2014.109
  • Filename
    6973815