DocumentCode :
153365
Title :
Combining Focus Measure Operators to Predict OCR Accuracy in Mobile-Captured Document Images
Author :
Rusinol, Marcal ; Chazalon, Joseph ; Ogier, Jean-Marc
Author_Institution :
L3i Lab., Univ. de La Rochelle, La Rochelle, France
fYear :
2014
fDate :
7-10 April 2014
Firstpage :
181
Lastpage :
185
Abstract :
Mobile document image acquisition is a new trend raising serious issues in business document processing workflows. Such digitization procedure is unreliable, and integrates many distortions which must be detected as soon as possible, on the mobile, to avoid paying data transmission fees, and losing information due to the inability to re-capture later a document with temporary availability. In this context, out-of-focus blur is major issue: users have no direct control over it, and it seriously degrades OCR recognition. In this paper, we concentrate on the estimation of focus quality, to ensure a sufficient legibility of a document image for OCR processing. We propose two contributions to improve OCR accuracy prediction for mobile-captured document images. First, we present 24 focus measures, never tested on document images, which are fast to compute and require no training. Second, we show that a combination of those measures enables state-of-the art performance regarding the correlation with OCR accuracy. The resulting approach is fast, robust, and easy to implement in a mobile device. Experiments are performed on a public dataset, and precise details about image processing are given.
Keywords :
business data processing; data acquisition; document image processing; mobile computing; optical character recognition; OCR accuracy prediction; OCR processing; OCR recognition; business document processing workflows; focus measure operators; focus quality estimation; image processing; mobile device; mobile-captured document image acquisition; out-of-focus blur; sufficient document image legibility; temporary document availability; Accuracy; Atmospheric measurements; Engines; Histograms; Mobile communication; Optical character recognition software; Particle measurements; Camera-acquired document images; OCR prediction; Quality Assessment; focus measures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
Type :
conf
DOI :
10.1109/DAS.2014.11
Filename :
6830994
Link To Document :
بازگشت