• DocumentCode
    1640668
  • Title

    Coupled Snakelet Model for Curled Textline Segmentation of Camera-Captured Document Images

  • Author

    Bukhari, Syed Saqib ; Shafait, Faisal ; Breuel, Thomas M.

  • Author_Institution
    Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
  • fYear
    2009
  • Firstpage
    61
  • Lastpage
    65
  • Abstract
    Detection of curled textline is important for dewarping of hand-held camera-captured document images. Then baselines and the lines following the top of x-height of characters (x-lines) are estimated for dewarping. Existing curled textline segmentation approaches are sensitive to outlier points and perspective distortions. Furthermore these approaches use regression over top and bottom points of a segmented textline to estimate its x-line and baseline separately, which may results in inaccurate estimation. Here we propose a novel curled textline segmentation approach based on active contours (snakes) in which we perform segmentation by estimating the pairs of x-line and baseline; solving both problems together. Starting form a connected component we jointly trace a pair of x-line and baseline using coupled snakes and external energies of neighboring top-bottom points. We grow neighborhood region iteratively during tracing, which results in robustness to perspective distortions, and maintain a natural property of similar distance within the pair of x-line and baseline pair, which results in robustness to outlier points. We achieved 90.76% of one-to-one match-score recognition accuracy of curled textline segmentation on CBDAR 2007 document image dewarping contest dataset, with good estimation of pairs of x-line and baseline.
  • Keywords
    distortion; document image processing; edge detection; image matching; image segmentation; object detection; object recognition; regression analysis; text analysis; CBDAR 2007; active contour; baseline estimation; connected component; coupled snakelet model; curled textline detection; curled textline segmentation approach; document image dewarping contest dataset; hand-held camera-captured document image; one-to-one match-score recognition; outlier point; perspective distortion; regression method; x-line estimation; Active contours; Artificial intelligence; Coupled mode analysis; Image analysis; Image recognition; Image segmentation; Level set; Merging; Robustness; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.204
  • Filename
    5277789