DocumentCode
2426261
Title
Document Image Segmentation as a Spectral Partitioning Problem
Author
Dasigi, Praveen ; Jain, Raman ; Jawahar, C.V.
Author_Institution
Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
305
Lastpage
312
Abstract
State of art document segmentation algorithms employ ad hoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to perform well for complex scripts. This calls for a generalized solution that achieves a one shot segmentation that is globally optimal. This paper describes one such solution-based on the optimization problem of spectral partitioning which makes the decision of proper segmentation based on the spectral properties of the pairwise similarity matrix. The solution described in the paper is shown to be general, global and closed form. The claims have been demonstrated on 142 page images from a Telugu book, in a script set in both poetry and prose layouts. This particular class of scripts has been proved to be challenging for the existing state of the art algorithms, where the proposed solution achieves significant results.
Keywords
document image processing; image segmentation; matrix algebra; optimisation; document image segmentation; document properties; optimization problem; pairwise similarity matrix; spectral partitioning problem; Algorithm design and analysis; Art; Computer graphics; Computer vision; Image segmentation; Iterative algorithms; Natural languages; Nearest neighbor searches; Optical character recognition software; Partitioning algorithms; document segmentation; spectral graph partitioning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, Graphics & Image Processing, 2008. ICVGIP '08. Sixth Indian Conference on
Conference_Location
Bhubaneswar
Print_ISBN
978-0-7695-3476-3
Electronic_ISBN
978-0-7695-3476-3
Type
conf
DOI
10.1109/ICVGIP.2008.96
Filename
4756086
Link To Document