• DocumentCode
    721311
  • Title

    Clustered Outband Deduplication on Primary Data

  • Author

    Agrawal, Archana Satynarayan ; Malhotra, Jyoti

  • Author_Institution
    Dept. of Inf. Technol., MIT Coll. of Eng., Pune, India
  • fYear
    2015
  • fDate
    26-27 Feb. 2015
  • Firstpage
    446
  • Lastpage
    450
  • Abstract
    Data reduplication is a special technique to recognize the duplicate data, which stores only one copy for all redundant data, and creates a link to that copy so when data is access by the user, one will have to refer that link. Out-band reduplication is done, after the backup has been written. In today´s era, data reduplication has become the necessity and critical component for the primary storage. In this paper, we have discussed the technique of Out-band data reduplication on primary storage workloads such as data used in common day-to-day life and user directories. This paper also discusses the implementation of out-band reduplication for primary storage in order to increase the RAM performance, throughput, and efficient lookups in the memory. To improve the lookup performance of the system, we have used cuckoo filter instead of multi-layer traditional filters.
  • Keywords
    storage management; RAM performance; clustered outband deduplication; cuckoo filter; multilayer traditional filters; out-band data reduplication technique; primary data reduplication; primary storage; Fingerprint recognition; Indexes; Matched filters; Random access memory; Servers; Throughput; Backup; cache; chunking; lookups; metadata; post-process; primary data and storage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing Communication Control and Automation (ICCUBEA), 2015 International Conference on
  • Conference_Location
    Pune
  • Type

    conf

  • DOI
    10.1109/ICCUBEA.2015.93
  • Filename
    7155886