DocumentCode
3582610
Title
Towards a standard Bangla PhotoOCR: Text detection and localization
Author
Islam, Md Zahidul ; Mondal, Amit Kumar
Author_Institution
Comput. Sci. & Eng.Discipline, Khulna Univ., Khulna, Bangladesh
fYear
2014
Firstpage
198
Lastpage
203
Abstract
A complete Bangla PhotoOCR requires a series of carefully chosen algorithms. Text extraction from images is a long-standing active research area. It is even more attractive today due to the availability of low-cost mobile image acquisition devices. Many researchers have addressed this problem using different approaches. Often times, the first step towards text extraction from images is detection of text areas. Bangla texts, specially in images, pose a unique set of challenges than texts in other languages. In this paper, we experiment with two established approaches, available for other languages, to automatically localize Bangla texts in complex natural scene images towards developing a complete Bangla PhotoOCR system. In our approach, features are extracted from an image using wavelets based decomposition and histogram calculation techniques. We use 56 features to train two different types of classifiers (ANN based and SVM based) to localize Bangla texts in natural scene images. Our experimental results show that ANN is a good classifier for identifying Bangla texts.
Keywords
feature extraction; image classification; neural nets; optical character recognition; support vector machines; text detection; wavelet transforms; ANN-based classifier; SVM-based classifier; automatic Bangla text localization; complex natural scene images; feature extraction; histogram calculation techniques; mobile image acquisition devices; standard Bangla PhotoOCR; text area detection; text extraction; wavelet-based decomposition; Artificial neural networks; Character recognition; Feature extraction; Image edge detection; Roads; Support vector machines; Text recognition; Bangla Text Extraction; Computer Vision; Pattern Recognition; PhotoOCR;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Technology (ICCIT), 2014 17th International Conference on
Type
conf
DOI
10.1109/ICCITechn.2014.7073084
Filename
7073084
Link To Document