• DocumentCode
    3379569
  • Title

    A medial axis based thinning strategy and structural feature extraction of character images

  • Author

    Bag, Soumen ; Harit, Gaurav

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Kharagpur, Kharagpur, India
  • fYear
    2010
  • fDate
    26-29 Sept. 2010
  • Firstpage
    2173
  • Lastpage
    2176
  • Abstract
    The thinning methodology is novel in terms of its ability to incorporate character shape specific knowledge while constructing the thinned skeleton. But removal of spurious strokes or shape deformation in thinning is a difficult problem. In this paper, we have proposed a novel medial-axis based thinning strategy used for performing skeletonization of noisy character images. The proposed algorithm produces segmented strokes in vector form as a by-product. Hence further stroke segmentation is not required. Experiment is done on printed English, Bengali, Hindi, and Tamil characters and we obtain less spurious branches compared to other thinning methods without any post processing. We have concluded with a proposed methodology to extract structural features from thinned character images. This feature set improves the performance of existing OCR for Indian languages.
  • Keywords
    character recognition; feature extraction; image segmentation; image thinning; medical image processing; character images; medial axis based thinning strategy; shape deformation; skeletonization; stroke segmentation; structural feature extraction; Extrapolation; Feature extraction; Image segmentation; Junctions; Pixel; Shape; Skeleton; Medial-axis; concavity and convexity; skeleton segment; structural feature; thinning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2010 17th IEEE International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-7992-4
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2010.5654311
  • Filename
    5654311