• DocumentCode
    134569
  • Title

    Skeleton extraction: Comparison of five methods on the Arabic IFN/ENIT database

  • Author

    Al-Shatnawi, Atallah M. ; Omar, K. ; AlFawwaz, Bader M. ; Zeki, Akram M.

  • Author_Institution
    Dept. of Inf. Syst., Al-albayt Univ., Mafraq, Jordan
  • fYear
    2014
  • fDate
    26-27 March 2014
  • Firstpage
    50
  • Lastpage
    59
  • Abstract
    Thinning “Skeletonization” is a very crucial stage in the Arabic Character Recognition (ACR) system. It simplifies the text shape and reduces the amount of data that needs to be handled and it is usually used as a pre-processing stage for recognition and storage systems. The skeleton of Arabic text can be used for: baseline detection, character segmentation, and features extraction, and ultimately supporting the classification. In this paper, five of the state of the art thinning algorithms are selected and implemented. The five algorithms are: SPTA, Zhang-Suen parallel thinning algorithm, Huang-Wan-Liu thinning algorithm, thinning and skeletonization based morphological operation algorithms. The five selected algorithms are applied on the IFN/ENIT dataset. The results obtained by the five methods are discussed and analyzed against the IFN/ENIT dataset based on preserving shape and the text connectivity, preventing spurious tails, maintaining one pixel width skeleton and avoiding the necking problem as well as running time efficiently. In addition to that some performance measurement for checking text connectivity, spurious tails and calculating the stroke thickness are proposed and carried out.
  • Keywords
    character recognition; feature extraction; image classification; image segmentation; image thinning; ACR system; Arabic IFN/ENIT database; Arabic character recognition system; Arabic text skeleton; Huang-Wan-Liu thinning algorithm; IFN/ENIT dataset; SPTA; Zhang-Suen parallel thinning algorithm; baseline detection; character segmentation; classification; data handling; features extraction; performance measurement; shape preservation; skeleton extraction; skeletonization based morphological operation algorithm; spurious tail prevention; storage system; text connectivity preservation; text shape; thinning based morphological operation algorithm; thinning skeletonization; Algorithm design and analysis; Classification algorithms; Educational institutions; Image edge detection; Information systems; Shape; Skeleton; Arabic Character Recognition; Huang-Wan-L; SPTA; Skeleton; Text connectivity; Thinning; Zhang-Suen; morphological;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (CSIT), 2014 6th International Conference on
  • Conference_Location
    Amman
  • Print_ISBN
    978-1-4799-3998-5
  • Type

    conf

  • DOI
    10.1109/CSIT.2014.6805978
  • Filename
    6805978