• DocumentCode
    3487798
  • Title

    Unsupervised Ensemble of Experts (EoE) Framework for Automatic Binarization of Document Images

  • Author

    Moghaddam, Reza Farrahi ; Moghaddam, Faraz Fatemi ; Cheriet, Mohamed

  • Author_Institution
    Synchromedia Lab., Ecole de Technol. Super., Montreal, QC, Canada
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    703
  • Lastpage
    707
  • Abstract
    In recent years, a large number of binarization methods have been developed, with varying performance generalization and strength against different benchmarks. In this work, to leverage on these methods, an ensemble of experts (EoE) framework is introduced, to efficiently combine the outputs of various methods. The proposed framework offers a new selection process of the binarization methods, which are actually the experts in the ensemble, by introducing three concepts: confident ness, endorsement and schools of experts. The framework, which is highly objective, is built based on two general principles: (i) consolidation of saturated opinions and (ii) identification of schools of experts. After building the endorsement graph of the ensemble for an input document image based on the confident ness of the experts, the saturated opinions are consolidated, and then the schools of experts are identified by thresholding the consolidated endorsement graph. A variation of the framework, in which no selection is made, is also introduced that combines the outputs of all experts using endorsement-dependent weights. The EoE framework is evaluated on the set of participating methods in the H-DIBCO´12 contest and also on an ensemble generated from various instances of grid-based Sauvola method with promising performance.
  • Keywords
    document image processing; graph theory; unsupervised learning; EoE framework; automatic binarization; endorsement graph; endorsement-dependent weights; grid-based Sauvola method; input document image; unsupervised ensemble of experts framework; Decision making; Educational institutions; Feature extraction; Laplace equations; PSNR; Robustness; Visualization; Binarization; Confidentness; Consolidation; Endorsement; Endorsement Graph; Ensemble of Experts; Expert Selection; School of Experts;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.144
  • Filename
    6628709