Title :
Urdu optical character recognition technique using point feature matching; a generic approach
Author :
Wafa Qaiser Khan;Reema Qaiser Khan
Author_Institution :
Computer & Software Engineering Department, Bahria University Karachi Campus, Karachi, Pakistan
Abstract :
The complexity associated with Urdu fonts regarding OCR in newspapers is being dealt with active research. When creating an Urdu OCR you are limited to a certain font size i.e. if working with a font size of 12, you will have to create a database covering all characters/words of font size 12. In order to work with another font size of same Urdu font you´ll have to cover all the characters/words of that respective font size. The OCR technique should be generic where the font size should not matter. The objective was to create a technique that could be applied to any Urdu script font size, without worrying about the variation of characters/words caused by the disposal of ink in Urdu newspaper clippings. In this paper the authors have developed a technique using point feature matching on cropped Urdu newspaper clippings with font Jameel Noori Nastaleeq and converted them into editable textual Unicodes.
Keywords :
"Feature extraction","Optical character recognition software","Databases","Robustness","Computers","Optical filters","Optical imaging"
Conference_Titel :
Information and Communication Technologies (ICICT), 2015 International Conference on
DOI :
10.1109/ICICT.2015.7469576