DocumentCode
1919734
Title
Deduplication and compression techniques in cloud design
Author
Upadhyay, Amrita ; Balihalli, Pratibha R. ; Ivaturi, Shashibhushan ; Rao, Shrisha
Author_Institution
Electron. City, Int. Inst. of Inf. Technol. Bangalore, Bangalore, India
fYear
2012
fDate
19-22 March 2012
Firstpage
1
Lastpage
6
Abstract
Our approach to deduplication and compression in cloud computing aims at reduction in storage space and bandwidth usage during file transfers. The design depends on multiple metadata structures for deduplication. Only a copy of the duplicate files is retained while others are deleted. The existence of duplicate files is determined from the metadata. The files are clustered into bins depending on their size. They are then segmented, deduplicated, compressed and stored. Binning restricts the number of segments and their sizes so that it is optimum for each file size. When the user requests a file, compressed segments of the file are sent over the network along with the file-to-segment mapping. These are the uncompressed and combined to create a complete file, hence minimizing bandwidth requirements.
Keywords
cloud computing; data compression; meta data; storage management; bandwidth usage reduction; binning; cloud computing; cloud design; compression techniques; deduplication techniques; duplicate files; file compression; file deduplication; file segmentation; file size; file storage; file transfers; file-to-segment mapping; multiple metadata structures; storage space reduction; Arrays; Bandwidth; Cloud computing; Digital audio players; Electromagnetic compatibility; Eucalyptus; SHA-1; algorithm; bandwidth; binning; cloud; compression; deduplication; hash value; network; segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems Conference (SysCon), 2012 IEEE International
Conference_Location
Vancouver, BC
Print_ISBN
978-1-4673-0748-2
Type
conf
DOI
10.1109/SysCon.2012.6189472
Filename
6189472
Link To Document