DocumentCode
3701537
Title
Models for decryption of historical shorthand documents
Author
Aleksandr A. Rogov;Mikhail B. Gippiev;Ivan A. Shterkel
Author_Institution
Petrozavodsk State University, 33 pr. Lenina, Petrozavodsk, Karelia, 185910, Russia
fYear
2015
Firstpage
665
Lastpage
667
Abstract
This article presents methods that are used for historical shorthand documents recognition. We distinguish following tasks: binarization, clusterization, lines recognition and determination of symbols types (main, superscript, subscript). Each method is evaluated in terms of recall, precision and F-measure criteria. The best method for binarization of shorthand documents appeared to be the modified threshold method. We proposed following methods for graphic symbols clustering: the method of segments lengths comparison, the method of projections comparison and the method of baskets. The best result is achieved with the method of baskets. We also present the algorithms of lines recognition and symbols classification. Lines recognition is performed using two methods: nearest neighbour and relations graph construction. Symbols classification is done by single and by double approximation methods and their modification. The best result of lines segmentation is demonstrated by the method of relations graph construction, and the best result of determination of symbols types is demonstrated by the modified double approximation method.
Keywords
"Approximation methods","Approximation algorithms","Image segmentation","Classification algorithms","Clustering methods","Shape"
Publisher
ieee
Conference_Titel
"Stability and Control Processes" in Memory of V.I. Zubov (SCP), 2015 International Conference
Type
conf
DOI
10.1109/SCP.2015.7342239
Filename
7342239
Link To Document