• DocumentCode
    2489961
  • Title

    Dynamic filters selection for textual document image binarization

  • Author

    Cecotti, Hubert ; Belad, A.

  • Author_Institution
    Inst. of Autom. (IAT), Univ. of Bremen, Bremen
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    For a document class, one challenge in document binarization is to automatically find a set of techniques, which are adapted to the different degradation level of the images. It is important to know the methods to use and where they can be applied advantageously. A multi-classifiers solution is presented for pixel classification. These classifiers act as filters and are used for binarization. The technique starts by clustering close pixels by K-means. A classifier, which corresponds to a supervised neural network, is dedicated to each cluster. They are trained according to a binarized image where its pixels are weighted function to erosion transformation effects. The presented method is compared to classical binarization techniques in the literature. Its effect on the commercial OCR performance reaches a gain from 0.16% for Finereader7 and 1.06% for Omnipage14 for the recognition rate.
  • Keywords
    document image processing; image classification; learning (artificial intelligence); neural nets; Finereader7; K-means; Omnipagel4; commercial OCR performance; degradation level; dynamic filters selection; erosion transformation effects; multiclassifiers solution; pixel classification; supervised neural network; textual document image binarization; Automation; Degradation; Distributed control; Filters; Image recognition; Image restoration; Neural networks; Optical character recognition software; Performance gain; Pixel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761842
  • Filename
    4761842