DocumentCode :
248619
Title :
A deep learning approach to document image quality assessment
Author :
Le Kang ; Peng Ye ; Yi Li ; Doermann, David
Author_Institution :
Univ. of Maryland, College Park, MD, USA
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
2570
Lastpage :
2574
Abstract :
This paper proposes a deep learning approach for document image quality assessment. Given a noise corrupted document image, we estimate its quality score as a prediction of OCR accuracy. First the document image is divided into patches and non-informative patches are sifted out using Otsu´s binarization technique. Second, quality scores are obtained for all selected patches using a Convolutional Neural Network (CNN), and the patch scores are averaged over the image to obtain the document score. The proposed CNN contains two layers of convolution, location blind max-min pooling, and Rectified Linear Units in the fully connected layers. Experiments on two document quality datasets show our method achieved the state of the art performance.
Keywords :
convolution; document image processing; image denoising; learning (artificial intelligence); neural nets; CNN; OCR accuracy; Otsu binarization technique; convolution layer; convolutional neural network; deep learning approach; document image quality assessment; location blind max-min pooling; noise corrupted document image; optical character recognition; quality score; rectified linear units; Accuracy; Convolution; Image quality; Neural networks; Optical character recognition software; System-on-chip; Training; Convolutional neural networks; document; image quality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
Type :
conf
DOI :
10.1109/ICIP.2014.7025520
Filename :
7025520
Link To Document :
بازگشت