• DocumentCode
    1639455
  • Title

    A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines

  • Author

    Shi, Zhixin ; Setlur, Srirangaraj ; Govindaraju, Venu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., State Univ. of New York at Buffalo, Buffalo, NY, USA
  • fYear
    2009
  • Firstpage
    176
  • Lastpage
    180
  • Abstract
    In this paper, we present a new text line extraction method for handwritten Arabic documents. The proposed technique is based on a generalized adaptive local connectivity map (ALCM) using a steerable directional filter. The algorithm is designed to solve the particularly complex problems seen in handwritten documents such as fluctuating, touching or crossing text lines. The proposed algorithm consists of three steps. Firstly, a steerable filter is used to probe and determine foreground intensity along multiple directions at each pixel while generating the ALCM. The ALCM is then binarized using an adaptive thresholding algorithm to get a rough estimate of the location of the text lines. In the second step, connected component analysis is used to classify text and non text patterns in the generated ALCM to refine the location of the text lines. Finally, the text lines are separated by superimposing the text line patterns in the ALCM on the original document image and extracting the connected components covered by the pattern mask. Analysis of experimental results on the DARPA MADCAT Arabic handwritten document data indicate that the method is robust and is capable of correctly isolating handwritten text lines even on challenging document images.
  • Keywords
    document image processing; handwriting recognition; image classification; image segmentation; statistical analysis; adaptive thresholding algorithm; connected component analysis; document image; generalized adaptive local connectivity map; handwritten Arabic document; handwritten Arabic text line extraction; steerable directional filter; steerable directional local profile technique; text classification; Adaptive filters; Biometrics; Biosensors; Handwriting recognition; Iterative algorithms; Partitioning algorithms; Strips; Text analysis; Text recognition; Venus; Arabic document; OCR; andwritten text line extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.79
  • Filename
    5277744