DocumentCode :
3695199
Title :
Localized document image change detection
Author :
Rajiv Jain;David Doermann
Author_Institution :
Institute for Advanced Computer Studies, University of Maryland, College Park, USA
fYear :
2015
Firstpage :
786
Lastpage :
790
Abstract :
Given two versions of a document image, the goal of document image change detection is to automatically determine exactly what content was added, deleted or modified. Typically, one would accomplish this by first performing Optical Character Recognition (OCR) on the two documents and then performing a “diff” to identify the changes. However, this approach can fail due to OCR errors, poor segmentation, or the inability to handle graphical content. We compare the OCR baseline with two techniques based on SIFT features that detect changes in the image at the word level. The first approach performs the “diff” on SIFT features extracted from the center line of the text image. The second approach performs a segmentation free alignment of text blocks using dense SIFT to address the more general cases where segmentation fails or graphical objects are modified. Results on two experimental datasets show the improvement of the segmentation free approach over the baseline approach.
Keywords :
"Optical character recognition software","Image segmentation","Image recognition","Robustness","Electronic mail","Lighting"
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
Type :
conf
DOI :
10.1109/ICDAR.2015.7333869
Filename :
7333869
Link To Document :
بازگشت