Title :
Coupled Snakelet Model for Curled Textline Segmentation of Camera-Captured Document Images
Author :
Bukhari, Syed Saqib ; Shafait, Faisal ; Breuel, Thomas M.
Author_Institution :
Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
Abstract :
Detection of curled textline is important for dewarping of hand-held camera-captured document images. Then baselines and the lines following the top of x-height of characters (x-lines) are estimated for dewarping. Existing curled textline segmentation approaches are sensitive to outlier points and perspective distortions. Furthermore these approaches use regression over top and bottom points of a segmented textline to estimate its x-line and baseline separately, which may results in inaccurate estimation. Here we propose a novel curled textline segmentation approach based on active contours (snakes) in which we perform segmentation by estimating the pairs of x-line and baseline; solving both problems together. Starting form a connected component we jointly trace a pair of x-line and baseline using coupled snakes and external energies of neighboring top-bottom points. We grow neighborhood region iteratively during tracing, which results in robustness to perspective distortions, and maintain a natural property of similar distance within the pair of x-line and baseline pair, which results in robustness to outlier points. We achieved 90.76% of one-to-one match-score recognition accuracy of curled textline segmentation on CBDAR 2007 document image dewarping contest dataset, with good estimation of pairs of x-line and baseline.
Keywords :
distortion; document image processing; edge detection; image matching; image segmentation; object detection; object recognition; regression analysis; text analysis; CBDAR 2007; active contour; baseline estimation; connected component; coupled snakelet model; curled textline detection; curled textline segmentation approach; document image dewarping contest dataset; hand-held camera-captured document image; one-to-one match-score recognition; outlier point; perspective distortion; regression method; x-line estimation; Active contours; Artificial intelligence; Coupled mode analysis; Image analysis; Image recognition; Image segmentation; Level set; Merging; Robustness; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.204