Title :
Document Image Segmentation as a Spectral Partitioning Problem
Author :
Dasigi, Praveen ; Jain, Raman ; Jawahar, C.V.
Author_Institution :
Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad
Abstract :
State of art document segmentation algorithms employ ad hoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to perform well for complex scripts. This calls for a generalized solution that achieves a one shot segmentation that is globally optimal. This paper describes one such solution-based on the optimization problem of spectral partitioning which makes the decision of proper segmentation based on the spectral properties of the pairwise similarity matrix. The solution described in the paper is shown to be general, global and closed form. The claims have been demonstrated on 142 page images from a Telugu book, in a script set in both poetry and prose layouts. This particular class of scripts has been proved to be challenging for the existing state of the art algorithms, where the proposed solution achieves significant results.
Keywords :
document image processing; image segmentation; matrix algebra; optimisation; document image segmentation; document properties; optimization problem; pairwise similarity matrix; spectral partitioning problem; Algorithm design and analysis; Art; Computer graphics; Computer vision; Image segmentation; Iterative algorithms; Natural languages; Nearest neighbor searches; Optical character recognition software; Partitioning algorithms; document segmentation; spectral graph partitioning;
Conference_Titel :
Computer Vision, Graphics & Image Processing, 2008. ICVGIP '08. Sixth Indian Conference on
Conference_Location :
Bhubaneswar
Print_ISBN :
978-0-7695-3476-3
Electronic_ISBN :
978-0-7695-3476-3
DOI :
10.1109/ICVGIP.2008.96