• DocumentCode
    3485989
  • Title

    Ground-Truth Estimation in Multispectral Representation Space: Application to Degraded Document Image Binarization

  • Author

    Hedjam, Rachid ; Cheriet, Mohamed

  • Author_Institution
    Synchromedia Lab. for Multemedia Commun. in Telepresence, Montreal, QC, Canada
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    190
  • Lastpage
    194
  • Abstract
    Human ground-truthing is the manual labelling of samples (pixels for example) to generate reference data without any automatic algorithm help. Although a manual ground-truth is more accurate than a machine ground-truth, it still suffers from mislabeling and/or judgement errors. In this paper we propose a new method of ground-truth estimation using multispectral (MS) imaging representation space for the sake of document image binarization. Starting from the initial manual ground-truth, the proposed classification method aims to select automatically some samples with correct labels (well-labeled pixels) from each class for the training phase, then reassign new labels to the document image pixels. The classification scheme is based on the cooperation of multiple classifiers under some constraints. A real data set of MS historical document images and their ground-truth is created to demonstrate the effectiveness of the proposed method of ground-truth estimation.
  • Keywords
    document image processing; image classification; image representation; image resolution; MS historical document images; MS imaging representation space; degraded document image binarization; document image pixels; ground-truth estimation; human ground-truthing; image classification method; multispectral imaging representation space; Estimation; Image color analysis; Imaging; Labeling; Manuals; Text analysis; Training; Document image analysis; Document image binarization; Ground-truth estimation; Historical document images; Multispectral document imaging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.45
  • Filename
    6628610