DocumentCode :
1632890
Title :
Pre-Processing of Degraded Printed Documents by Non-local Means and Total Variation
Author :
Likforman-Sulem, Laurence ; Darbon, Jérôme ; Smith, Elisa H Barney
Author_Institution :
Telecom ParisTech, Paris, France
fYear :
2009
Firstpage :
758
Lastpage :
762
Abstract :
We compare in this study two image restoration approaches for the pre-processing of printed documents:namely the Non-local Means filter and a total variation minimization approach. We apply these two approaches to printed document sets from various periods,and we evaluate their effectiveness through character recognition performance using an open source OCR. Our results show that for each document set, one or both pre-processing methods improve character recog-nition accuracy over recognition without preprocessing. Higher accuracies are obtained with Non-local Means when characters have a low level of degradation since they can be restored by similar neighboring parts of non-degraded characters. The Total Variation approach is more effective when characters are highly degraded and can only be restored through modeling instead of using neighboring data.
Keywords :
document image processing; image restoration; minimisation; optical character recognition; character recognition; degraded printed document preprocessing; image restoration; nonlocal means filter; open source OCR; total variation minimization approach; Background noise; Character recognition; Context modeling; Degradation; Filtering; Image restoration; Image segmentation; Ink; Optical character recognition software; TV; Document Image restoration; degraded documents; non-local means; total variation; variational approach;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
ISSN :
1520-5363
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2009.210
Filename :
5277501
Link To Document :
بازگشت