DocumentCode :
3510534
Title :
BengalI Optical Character Recognition using self organizing map
Author :
Kibria, Muhammad Golam ; Al-Imtiaz
Author_Institution :
Dept. of CSE, Univ. of Inf. Technol. & Sci., Dhaka, Bangladesh
fYear :
2012
fDate :
18-19 May 2012
Firstpage :
764
Lastpage :
769
Abstract :
Being the 5th Position and sweetest language in the world declared by the UNESCO Bengali is the national language in Bangladesh and one of the major languages in India. Lot of researches has been done to recognize Bengali, English and other major languages using Optical Character Recognition (OCR). To recognize Bengali character from text images and convert into editable text, Self Organizing Map (SOM) - kind of neural network has been used. To collect the character, documents are scanned, which is preprocessed with the Image to Binary Conversion Algorithm. In the binary image, character area is represented by 0 (zero) and rest of the image area is represented with 1 (one). After detecting and correcting the skew and noise, the binary image is processed and grouped, which can be mapped and recognized by SOM. Considering efficiency and fastness, character grouping process has been introduced.
Keywords :
document image processing; natural language processing; optical character recognition; self-organising feature maps; text detection; Bangladesh; Bengali optical character recognition; English; India; OCR; SOM; UNESCO; character grouping process; document scanning; image-binary conversion algorithm; neural network; selforganizing map; text images; Adaptive optics; Image segmentation; Integrated circuits; Integrated optics; Optical character recognition software; Optical imaging; Radio frequency;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics, Electronics & Vision (ICIEV), 2012 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4673-1153-3
Type :
conf
DOI :
10.1109/ICIEV.2012.6317479
Filename :
6317479
Link To Document :
بازگشت