• DocumentCode
    3488181
  • Title

    A Locale Group Based Line Segmentation Approach for Non Uniform Skewed and Curved Arabic Handwritings

  • Author

    Dinges, Laslo ; Al-Hamadi, Ayoub ; Elzobi, Moftah

  • Author_Institution
    Inst. for Electron. Signal Process. & Commun. (IESK), Otto-von-Guericke-Univ., Magdeburg, Germany
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    803
  • Lastpage
    806
  • Abstract
    In this paper we present a novel local group based method for extracting skewed and curved handwritten text lines in Arabic document images. We first detect all connected components and use a Support Vector Machine (SVM) to classify them either as Piece of Arabic Word (PAW) or diacritic. We then, use novel distance measures like sigmoid function based shapes to calculate the nearest neighbors for all PAWs. A subsequently graph based grouping algorithms, which follows the text lines from right to left, generates multiple candidate lines. After assessing the quality of all line candidates the final line representation is chosen. In a final step all PAWs which are not already part of a final line are inserted into the one that is closest. Experimental results show a successfully line segmentation for documents of different writers and styles.
  • Keywords
    document image processing; graph theory; handwriting recognition; image representation; image segmentation; natural language processing; support vector machines; text analysis; Arabic document images; PAW; Piece of Arabic Word; SVM; connected component classification; curved Arabic handwriting; curved handwritten text line extraction; final line representation; graph based grouping algorithms; locale group based line segmentation approach; nonuniform skewed Arabic handwriting; sigmoid function based shapes; skewed handwritten text line extraction; support vector machine; Accuracy; Current measurement; Density measurement; Image segmentation; Shape; Shape measurement; Support vector machines; Evolutionary Algorithms; Handwritten Arabic Documents; Line Segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.164
  • Filename
    6628729