Title :
Recognition of historical Greek polytonic scripts using LSTM networks
Author :
Fotini Simistira;Adnan Ul-Hassan;Vassilis Papavassiliou;Basilis Gatos;Vassilis Katsouros;Marcus Liwicki
Author_Institution :
Institute for Language and Speech Processing, Athena Research and Innovation Center, Athens, Greece
Abstract :
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Short-Term Memory (LSTM) Networks for Greek polytonic script. Even though there are many Greek polytonic manuscripts, the digitization of such documents has not been widely applied, and very limited work has been done on the recognition of such scripts. We have collected a large number of diverse document pages of Greek polytonic scripts in a novel database, called Polyton-DB, containing 15; 689 textlines of synthetic and authentic printed scripts and performed baseline experiments using LSTM Networks. Evaluation results show that the character error rate obtained with LSTM varies from 5.51% to 14.68% (depending on the document) and is better than two well-known OCR engines, namely, Tesseract and ABBYY FineReader.
Keywords :
"Logic gates","Optical character recognition software","Adaptive optics","Integrated optics","Optical imaging","Optical fiber networks","Yttrium"
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
DOI :
10.1109/ICDAR.2015.7333865