• DocumentCode
    2022525
  • Title

    Decompose Document Image Using Integer Linear Programming

  • Author

    Gao, Dashan ; Wang, Yizhou ; Hindi, Haitham ; Do, Minh

  • Author_Institution
    Univ. of California, San Diego
  • Volume
    1
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    397
  • Lastpage
    401
  • Abstract
    Document decomposition is a basic but crucial step for many document related applications. This paper proposes a novel approach to decompose document images into zones. It first generates overlapping zone hypotheses based on generic visual features. Then, each candidate zone is evaluated quantitatively by a learned generative zone model. We formulate the zone inference problem into a constrained optimization problem, so as to select an optimal set of non-overlapping zones that cover a given document image. The experimental results demonstrate that the proposed method is very robust to document structure variation and noise.
  • Keywords
    document image processing; integer programming; linear programming; constrained optimization problem; document image decomposition; document structure variation; integer linear programming; learned generative zone model; zone inference problem; Aggregates; Application software; Atomic layer deposition; Constraint optimization; Drives; Image decomposition; Image representation; Image segmentation; Integer linear programming; Noise robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4378739
  • Filename
    4378739