DocumentCode
3060019
Title
Fuzzy Rule Based Document Image Segmentation for Component Labeling
Author
Kapoor, Aditi ; Pandey, Parul ; Biswas, K.K.
Author_Institution
Amar Nath & Shashi Khosla Sch. of Inf. Technol., Indian Inst. of Technol., New Delhi, India
fYear
2011
fDate
15-17 Dec. 2011
Firstpage
11
Lastpage
14
Abstract
In this paper, we propose a fuzzy rule based method for extensive segmentation of documents, with the view of labeling various components like page header, title, figure and tables, column divider, text etc. The fuzzy rules are learned using evolutionary computation, based on labeled document images serving as ground truth. The proposed fuzzy rules exploit the placement heuristics among the various components in a technical paper. A genetic algorithm involving individual fitness functions of all components selects the best set of rules. The effectiveness of the proposed method is validated through a case study involving 300 documents. It is also shown that this approach can be extended to segmentation of magazine pages.
Keywords
document image processing; fuzzy set theory; image segmentation; knowledge based systems; column divider; component labeling; figure; fuzzy rule based document image segmentation; ground truth; magazine pages; page header; tables; title; Biological cells; Equations; Genetic algorithms; Image segmentation; Labeling; Layout; Text analysis; Document image; chromosomes; connected components; fuzzy rules; genetic algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), 2011 Third National Conference on
Conference_Location
Hubli, Karnataka
Print_ISBN
978-1-4577-2102-1
Type
conf
DOI
10.1109/NCVPRIPG.2011.10
Filename
6132989
Link To Document