DocumentCode :
3695261
Title :
Scale and rotation invariant OCR for Pashto cursive script using MDLSTM network
Author :
Riaz Ahmad;Muhammad Zeshan Afzal;Sheikh Faisal Rashid;Marcus Liwicki;Thomas Breuel
Author_Institution :
TU-Kaiserslautern, Germany
fYear :
2015
Firstpage :
1101
Lastpage :
1105
Abstract :
Optical Character Recognition (OCR) of cursive scripts like Pashto and Urdu is difficult due the presence of complex ligatures and connected writing styles. In this paper, we evaluate and compare different approaches for the recognition of such complex ligatures. The approaches include Hidden Markov Model (HMM), Long Short Term Memory (LSTM) network and Scale Invariant Feature Transform (SIFT). Current state of the art in cursive script assumes constant scale without any rotation, while real world data contain rotation and scale variations. This research aims to evaluate the performance of sequence classifiers like HMM and LSTM and compare their performance with descriptor based classifier like SIFT. In addition, we also assess the performance of these methods against the scale and rotation variations in cursive script ligatures. Moreover, we introduce a database of 480,000 images containing 1000 unique ligatures or sub-words of Pashto. In this database, each ligature has 40 scale and 12 rotation variations. The evaluation results show a significantly improved performance of LSTM over HMM and traditional feature extraction technique such as SIFT.
Keywords :
"Hidden Markov models","Feeds","Image recognition"
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
Type :
conf
DOI :
10.1109/ICDAR.2015.7333931
Filename :
7333931
Link To Document :
بازگشت