• DocumentCode
    604635
  • Title

    “i” — A novel algorithm for optical character recognition (OCR)

  • Author

    Shastry, S. ; Gunasheela, G. ; Dutt, T. ; Vinay, D.S. ; Rupanagudi, Sudhir Rao

  • Author_Institution
    WorldServe Educ., Bangalore, India
  • fYear
    2013
  • fDate
    22-23 March 2013
  • Firstpage
    389
  • Lastpage
    393
  • Abstract
    Computer vision, artificial intelligence and pattern recognition have been important areas of research for a while in the history of electronics and image processing. Optical character recognition (OCR) is one of the main aspects of computer vision and has evolved greatly since its inception. OCR is a method in which readable characters are recognized from optical data obtained digitally. Many methodologies and algorithms have been developed for this purpose using different approaches. Here we present one such approach for OCR named “ i ”. Amongst all other OCR systems available, “ i ” aims at a high speed, simple, font independent and size independent OCR system based on a unique segment extraction technique. This algorithm can be used as a kernel for single alphabet detection within a complete OCR solution system without the need for any complex mathematical operations. The highlight of this methodology is that, it does not use any libraries or databases of image matrices to recognize alphabets, but it has a unique algorithm to recognize alphabets instead. This algorithm has been implemented in MATLAB 7.14.0.739 build R2012a on a test set of 500 images of text and an accuracy of 100% for three font families namely Arial, Times New Roman and cchas been obtained.
  • Keywords
    computer vision; feature extraction; mathematics computing; object detection; optical character recognition; Arial; Courier New; MATLAB 7.14.0.739; R2012a; Times New Romanv; artificial intelligence; computer vision; font independent OCR system; i algorithm; optical character recognition; pattern recognition; segment extraction technique; single alphabet detection; size independent OCR system; Character recognition; Feature extraction; Image edge detection; Image segmentation; Optical character recognition software; Training; Vectors; Feature extraction; OCR; image processing based OCR; segment extraction; segment profiling; segment storage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automation, Computing, Communication, Control and Compressed Sensing (iMac4s), 2013 International Multi-Conference on
  • Conference_Location
    Kottayam
  • Print_ISBN
    978-1-4673-5089-1
  • Type

    conf

  • DOI
    10.1109/iMac4s.2013.6526442
  • Filename
    6526442