Title :
A Novel Preprocessing Method for Hectography Prints Based on Independent Component Analysis
Author :
Kurbiel, Thomas ; Konya, Iuliu ; Eickeler, Stefan
Author_Institution :
Fraunhofer Inst. for Intell. Anal. & Inf. Syst. (IAIS), St. Augustin, Germany
Abstract :
Archives and cultural facilities consists of vast spectra of different document classes, many of which are not encountered today anymore. The digitization therefore calls for image enhancement and preprocessing solutions so far not required and hence unsolved. A prominent document class in this context is hectography, which was an inexpensive printing and duplication method, widely used throughout the 20th century. The major challenge with hectography is poor contrast on the one side and multiple degradation effects on the other side. In this paper a novel preprocessing method for hectography duplicates is proposed which leads to better Optical Character Recognition (OCR) results compared to traditional methods that operate on grayscale images. The proposed method is based on a linear mixing model used in independent component analysis. The problem of unwanted Gaussian noise components is considered as well.
Keywords :
Gaussian noise; document image processing; image enhancement; independent component analysis; optical character recognition; document class digitization; grayscale images; hectography duplicates; hectography print preprocessing method; image enhancement; independent component analysis; linear mixing model; optical character recognition; unwanted Gaussian noise component; Degradation; Image color analysis; Independent component analysis; Noise; Optical character recognition software; Principal component analysis; Random variables; OCR; extraction; hectography; image enhancement; independent component analysis;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.231