DocumentCode
3695261
Title
Scale and rotation invariant OCR for Pashto cursive script using MDLSTM network
Author
Riaz Ahmad;Muhammad Zeshan Afzal;Sheikh Faisal Rashid;Marcus Liwicki;Thomas Breuel
Author_Institution
TU-Kaiserslautern, Germany
fYear
2015
Firstpage
1101
Lastpage
1105
Abstract
Optical Character Recognition (OCR) of cursive scripts like Pashto and Urdu is difficult due the presence of complex ligatures and connected writing styles. In this paper, we evaluate and compare different approaches for the recognition of such complex ligatures. The approaches include Hidden Markov Model (HMM), Long Short Term Memory (LSTM) network and Scale Invariant Feature Transform (SIFT). Current state of the art in cursive script assumes constant scale without any rotation, while real world data contain rotation and scale variations. This research aims to evaluate the performance of sequence classifiers like HMM and LSTM and compare their performance with descriptor based classifier like SIFT. In addition, we also assess the performance of these methods against the scale and rotation variations in cursive script ligatures. Moreover, we introduce a database of 480,000 images containing 1000 unique ligatures or sub-words of Pashto. In this database, each ligature has 40 scale and 12 rotation variations. The evaluation results show a significantly improved performance of LSTM over HMM and traditional feature extraction technique such as SIFT.
Keywords
"Hidden Markov models","Feeds","Image recognition"
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
Type
conf
DOI
10.1109/ICDAR.2015.7333931
Filename
7333931
Link To Document