• DocumentCode
    1639423
  • Title

    Devanagari and Bangla Text Extraction from Natural Scene Images

  • Author

    Bhattacharya, U. ; Parui, S.K. ; Mondal, S.

  • Author_Institution
    Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Kolkata, India
  • fYear
    2009
  • Firstpage
    171
  • Lastpage
    175
  • Abstract
    With the increasing popularity of digital cameras attached with various handheld devices, many new computational challenges have gained significance. One such problem is extraction of texts from natural scene images captured by such devices. The extracted text can be sent to OCR or to a text-to-speech engine for recognition. In this article, we propose a novel and effective scheme based on analysis of connected components for extraction of Devanagari and Bangla texts from camera captured scene images. A common unique feature of these two scripts is the presence of headline and the proposed scheme uses mathematical morphology operations for their extraction. Additionally, we consider a few criteria for robust filtering of text components from such scene images. Moreover, we studied the problem of binarization of such scene images and observed that there are situations when repeated binarization by a well-known global thresholding approach is effective. We tested our algorithm on a repository of 100 scene images containing texts of Devanagari and / or Bangla.
  • Keywords
    feature extraction; filtering theory; image segmentation; mathematical morphology; optical character recognition; text analysis; digital camera; image thresholding; mathematical morphology; natural scene image; optical character recognition; robust filtering; text extraction; text-to-speech engine; Digital cameras; Engines; Handheld computers; Image analysis; Layout; Morphology; Optical character recognition software; Robustness; Speech synthesis; Text recognition; Camera-based document recognition; Text extraction from scene images;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.178
  • Filename
    5277743