• DocumentCode
    3773803
  • Title

    Urdu optical character recognition technique using point feature matching; a generic approach

  • Author

    Wafa Qaiser Khan;Reema Qaiser Khan

  • Author_Institution
    Computer & Software Engineering Department, Bahria University Karachi Campus, Karachi, Pakistan
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    The complexity associated with Urdu fonts regarding OCR in newspapers is being dealt with active research. When creating an Urdu OCR you are limited to a certain font size i.e. if working with a font size of 12, you will have to create a database covering all characters/words of font size 12. In order to work with another font size of same Urdu font you´ll have to cover all the characters/words of that respective font size. The OCR technique should be generic where the font size should not matter. The objective was to create a technique that could be applied to any Urdu script font size, without worrying about the variation of characters/words caused by the disposal of ink in Urdu newspaper clippings. In this paper the authors have developed a technique using point feature matching on cropped Urdu newspaper clippings with font Jameel Noori Nastaleeq and converted them into editable textual Unicodes.
  • Keywords
    "Feature extraction","Optical character recognition software","Databases","Robustness","Computers","Optical filters","Optical imaging"
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technologies (ICICT), 2015 International Conference on
  • Type

    conf

  • DOI
    10.1109/ICICT.2015.7469576
  • Filename
    7469576