• DocumentCode
    2426261
  • Title

    Document Image Segmentation as a Spectral Partitioning Problem

  • Author

    Dasigi, Praveen ; Jain, Raman ; Jawahar, C.V.

  • Author_Institution
    Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    305
  • Lastpage
    312
  • Abstract
    State of art document segmentation algorithms employ ad hoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to perform well for complex scripts. This calls for a generalized solution that achieves a one shot segmentation that is globally optimal. This paper describes one such solution-based on the optimization problem of spectral partitioning which makes the decision of proper segmentation based on the spectral properties of the pairwise similarity matrix. The solution described in the paper is shown to be general, global and closed form. The claims have been demonstrated on 142 page images from a Telugu book, in a script set in both poetry and prose layouts. This particular class of scripts has been proved to be challenging for the existing state of the art algorithms, where the proposed solution achieves significant results.
  • Keywords
    document image processing; image segmentation; matrix algebra; optimisation; document image segmentation; document properties; optimization problem; pairwise similarity matrix; spectral partitioning problem; Algorithm design and analysis; Art; Computer graphics; Computer vision; Image segmentation; Iterative algorithms; Natural languages; Nearest neighbor searches; Optical character recognition software; Partitioning algorithms; document segmentation; spectral graph partitioning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, Graphics & Image Processing, 2008. ICVGIP '08. Sixth Indian Conference on
  • Conference_Location
    Bhubaneswar
  • Print_ISBN
    978-0-7695-3476-3
  • Electronic_ISBN
    978-0-7695-3476-3
  • Type

    conf

  • DOI
    10.1109/ICVGIP.2008.96
  • Filename
    4756086