• DocumentCode
    1919734
  • Title

    Deduplication and compression techniques in cloud design

  • Author

    Upadhyay, Amrita ; Balihalli, Pratibha R. ; Ivaturi, Shashibhushan ; Rao, Shrisha

  • Author_Institution
    Electron. City, Int. Inst. of Inf. Technol. Bangalore, Bangalore, India
  • fYear
    2012
  • fDate
    19-22 March 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Our approach to deduplication and compression in cloud computing aims at reduction in storage space and bandwidth usage during file transfers. The design depends on multiple metadata structures for deduplication. Only a copy of the duplicate files is retained while others are deleted. The existence of duplicate files is determined from the metadata. The files are clustered into bins depending on their size. They are then segmented, deduplicated, compressed and stored. Binning restricts the number of segments and their sizes so that it is optimum for each file size. When the user requests a file, compressed segments of the file are sent over the network along with the file-to-segment mapping. These are the uncompressed and combined to create a complete file, hence minimizing bandwidth requirements.
  • Keywords
    cloud computing; data compression; meta data; storage management; bandwidth usage reduction; binning; cloud computing; cloud design; compression techniques; deduplication techniques; duplicate files; file compression; file deduplication; file segmentation; file size; file storage; file transfers; file-to-segment mapping; multiple metadata structures; storage space reduction; Arrays; Bandwidth; Cloud computing; Digital audio players; Electromagnetic compatibility; Eucalyptus; SHA-1; algorithm; bandwidth; binning; cloud; compression; deduplication; hash value; network; segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems Conference (SysCon), 2012 IEEE International
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4673-0748-2
  • Type

    conf

  • DOI
    10.1109/SysCon.2012.6189472
  • Filename
    6189472