Title :
Fast and efficient document image clean up and binarization based on retinex theory
Author :
Wagdy, M. ; Faye, Ibrahima ; Rohaya, D.
Author_Institution :
Centre of Intell. Signal & Imaging Res. (CISIR), Univ. Teknol. Petronas, Tronoh, Malaysia
Abstract :
Conversion from gray scale or color document image into binary image is the main and important step in most of optical character recognition (OCR) systems and document analysis. Most of the document images after digitization often suffer from poor contrast, noise, uniform lighting, and shadow. Clean up and binarization is an active subject in image processing, which addresses these problems. Most of the previous binarization methods that depend on local thresholding consume more time. The other methods that depend on global threshold are fast but don´t work well when the document image is degraded. In this paper we present fast and efficient document image clean up and binarization method based on retinex theory and global threshold. The proposed method is fast and produces high quality results compared to the previous works.
Keywords :
document image processing; image colour analysis; optical character recognition; OCR system; binarization method; binary image; color document image; document analysis; document image binarization; document image clean up; gray scale document image; image processing; local thresholding; optical character recognition; retinex theory; Degradation; Filtering theory; Lighting; Noise; Optical character recognition software; Text analysis; Binarization; Retinex theory; Thresholding;
Conference_Titel :
Signal Processing and its Applications (CSPA), 2013 IEEE 9th International Colloquium on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4673-5608-4
DOI :
10.1109/CSPA.2013.6530014