DocumentCode
2489961
Title
Dynamic filters selection for textual document image binarization
Author
Cecotti, Hubert ; Belad, A.
Author_Institution
Inst. of Autom. (IAT), Univ. of Bremen, Bremen
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
For a document class, one challenge in document binarization is to automatically find a set of techniques, which are adapted to the different degradation level of the images. It is important to know the methods to use and where they can be applied advantageously. A multi-classifiers solution is presented for pixel classification. These classifiers act as filters and are used for binarization. The technique starts by clustering close pixels by K-means. A classifier, which corresponds to a supervised neural network, is dedicated to each cluster. They are trained according to a binarized image where its pixels are weighted function to erosion transformation effects. The presented method is compared to classical binarization techniques in the literature. Its effect on the commercial OCR performance reaches a gain from 0.16% for Finereader7 and 1.06% for Omnipage14 for the recognition rate.
Keywords
document image processing; image classification; learning (artificial intelligence); neural nets; Finereader7; K-means; Omnipagel4; commercial OCR performance; degradation level; dynamic filters selection; erosion transformation effects; multiclassifiers solution; pixel classification; supervised neural network; textual document image binarization; Automation; Degradation; Distributed control; Filters; Image recognition; Image restoration; Neural networks; Optical character recognition software; Performance gain; Pixel;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761842
Filename
4761842
Link To Document