• DocumentCode
    3582610
  • Title

    Towards a standard Bangla PhotoOCR: Text detection and localization

  • Author

    Islam, Md Zahidul ; Mondal, Amit Kumar

  • Author_Institution
    Comput. Sci. & Eng.Discipline, Khulna Univ., Khulna, Bangladesh
  • fYear
    2014
  • Firstpage
    198
  • Lastpage
    203
  • Abstract
    A complete Bangla PhotoOCR requires a series of carefully chosen algorithms. Text extraction from images is a long-standing active research area. It is even more attractive today due to the availability of low-cost mobile image acquisition devices. Many researchers have addressed this problem using different approaches. Often times, the first step towards text extraction from images is detection of text areas. Bangla texts, specially in images, pose a unique set of challenges than texts in other languages. In this paper, we experiment with two established approaches, available for other languages, to automatically localize Bangla texts in complex natural scene images towards developing a complete Bangla PhotoOCR system. In our approach, features are extracted from an image using wavelets based decomposition and histogram calculation techniques. We use 56 features to train two different types of classifiers (ANN based and SVM based) to localize Bangla texts in natural scene images. Our experimental results show that ANN is a good classifier for identifying Bangla texts.
  • Keywords
    feature extraction; image classification; neural nets; optical character recognition; support vector machines; text detection; wavelet transforms; ANN-based classifier; SVM-based classifier; automatic Bangla text localization; complex natural scene images; feature extraction; histogram calculation techniques; mobile image acquisition devices; standard Bangla PhotoOCR; text area detection; text extraction; wavelet-based decomposition; Artificial neural networks; Character recognition; Feature extraction; Image edge detection; Roads; Support vector machines; Text recognition; Bangla Text Extraction; Computer Vision; Pattern Recognition; PhotoOCR;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (ICCIT), 2014 17th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICCITechn.2014.7073084
  • Filename
    7073084