DocumentCode :
2145068
Title :
Graph Clustering-Based Ensemble Method for Handwritten Text Line Segmentation
Author :
Manohar, Vasant ; Vitaladevuni, Shiv N. ; Cao, Huaigu ; Prasad, Rohit ; Natarajan, Prem
Author_Institution :
Speech, Language, & Multimedia Bus. Unit, Raytheon BBN Technol., Cambridge, MA, USA
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
574
Lastpage :
578
Abstract :
Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the recent advances in ensemble-based combination for pattern recognition problems, it is possible to improve the segmentation performance by combining the outputs from different line finding methods. In this paper, we propose a novel graph clustering-based approach to combine the output of an ensemble of text line segmentation algorithms. A weighted undirected graph is constructed with nodes corresponding to connected components and edge connecting pairs of connected components. Text line segmentation is then posed as the problem of minimum cost partitioning of the nodes in the graph such that each cluster corresponds to a unique line in the document image. Experimental results on a challenging Arabic field dataset using the ensemble method shows a relative gain of 18% in the F1 score over the best individual method within the ensemble.
Keywords :
document image processing; graph theory; handwritten character recognition; pattern recognition; text analysis; Arabic field dataset; document image; ensemble-based combination; graph clustering-based approach; graph clustering-based ensemble method; handwritten text line segmentation; hndwritten text line segmentation; line finding methods; pattern recognition problems; real-world data; segmentation performance; text line segmentation algorithms; weighted undirected graph; Clustering algorithms; Conferences; Corporate acquisitions; Handwriting recognition; Image edge detection; Image segmentation; Measurement; ensemble method; graph clustering; handwriting; text line segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.121
Filename :
6065376
Link To Document :
بازگشت