Title : 
Models for decryption of historical shorthand documents
         
        
            Author : 
Aleksandr A. Rogov;Mikhail B. Gippiev;Ivan A. Shterkel
         
        
            Author_Institution : 
Petrozavodsk State University, 33 pr. Lenina, Petrozavodsk, Karelia, 185910, Russia
         
        
        
        
        
            Abstract : 
This article presents methods that are used for historical shorthand documents recognition. We distinguish following tasks: binarization, clusterization, lines recognition and determination of symbols types (main, superscript, subscript). Each method is evaluated in terms of recall, precision and F-measure criteria. The best method for binarization of shorthand documents appeared to be the modified threshold method. We proposed following methods for graphic symbols clustering: the method of segments lengths comparison, the method of projections comparison and the method of baskets. The best result is achieved with the method of baskets. We also present the algorithms of lines recognition and symbols classification. Lines recognition is performed using two methods: nearest neighbour and relations graph construction. Symbols classification is done by single and by double approximation methods and their modification. The best result of lines segmentation is demonstrated by the method of relations graph construction, and the best result of determination of symbols types is demonstrated by the modified double approximation method.
         
        
            Keywords : 
"Approximation methods","Approximation algorithms","Image segmentation","Classification algorithms","Clustering methods","Shape"
         
        
        
            Conference_Titel : 
"Stability and Control Processes" in Memory of V.I. Zubov (SCP), 2015 International Conference
         
        
        
            DOI : 
10.1109/SCP.2015.7342239