• DocumentCode
    3003276
  • Title

    Robust unsupervised segmentation of degraded document images with topic models

  • Author

    Burns, Timothy J ; Corso, Jason J.

  • Author_Institution
    Comput. Sci. & Eng., SUNY at Buffalo, Buffalo, NY, USA
  • fYear
    2009
  • fDate
    20-25 June 2009
  • Firstpage
    1287
  • Lastpage
    1294
  • Abstract
    Segmentation of document images remains a challenging vision problem. Although document images have a structured layout, capturing enough of it for segmentation can be difficult. Most current methods combine text extraction and heuristics for segmentation, but text extraction is prone to failure and measuring accuracy remains a difficult challenge. Furthermore, when presented with significant degradation many common heuristic methods fall apart. In this paper, we propose a Bayesian generative model for document images which seeks to overcome some of these drawbacks. Our model automatically discovers different regions present in a document image in a completely unsupervised fashion. We attempt no text extraction, but rather use discrete patch-based codebook learning to make our probabilistic representation feasible. Each latent region topic is a distribution over these patch indices. We capture rough document layout with an MRF Potts model. We take an analysis by synthesis approach to examine the model, and provide quantitative segmentation results on a manually labeled document image data set. We illustrate our model´s robustness by providing results on a highly degraded version of our test set.
  • Keywords
    Bayes methods; document image processing; image segmentation; unsupervised learning; Bayesian generative model; MRF Potts model; discrete patch based codebook learning; document image data set; document images Segmentation; text extraction; Bayesian methods; Computer science; Current measurement; Degradation; Image analysis; Image segmentation; Pipelines; Robustness; Smoothing methods; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-3992-8
  • Type

    conf

  • DOI
    10.1109/CVPR.2009.5206606
  • Filename
    5206606