• DocumentCode
    2235447
  • Title

    A duplicate image deduplication approach via Haar wavelet technology

  • Author

    Ming Chen ; Yang Wang ; Xiaoxiang Zou ; Shupeng Wang ; Guangjun Wu

  • Author_Institution
    Nat. Eng. Lab. for Disaster Backup & Recovery, Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2012
  • fDate
    Oct. 30 2012-Nov. 1 2012
  • Firstpage
    624
  • Lastpage
    628
  • Abstract
    Traditional deduplication technologies can only eliminate exactly the same images and are unavailable for duplicate images. In order to solve this problem, we propose a duplicate image deduplication approach based on Haar wavelet. The proposed approach employs Haar wavelet decomposition to extract feature vectors of images, and exploits the Manhattan distance of feature vectors to judge the similarity of images. If two images are similar, we extract part data from feature vectors of corresponding images to create collections, and judge whether to deduplication by the comparison between the number of same elements of different collections and the threshold. The experimental results show that the proposed approach can achieve higher deduplication ratio and deduplication accuracy by setting suitable thresholds.
  • Keywords
    Haar transforms; data compression; feature extraction; image coding; wavelet transforms; Haar wavelet decomposition; Haar wavelet technology; Manhattan distance; deduplication accuracy; deduplication ratio; duplicate image deduplication approach; image feature vector extraction; Accuracy; Data mining; Feature extraction; Image resolution; Optimization; Redundancy; Vectors; Centriod selection; Haar wavelet; Image deduplication; Optimization accuracy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4673-1855-6
  • Type

    conf

  • DOI
    10.1109/CCIS.2012.6664249
  • Filename
    6664249