• DocumentCode
    3701537
  • Title

    Models for decryption of historical shorthand documents

  • Author

    Aleksandr A. Rogov;Mikhail B. Gippiev;Ivan A. Shterkel

  • Author_Institution
    Petrozavodsk State University, 33 pr. Lenina, Petrozavodsk, Karelia, 185910, Russia
  • fYear
    2015
  • Firstpage
    665
  • Lastpage
    667
  • Abstract
    This article presents methods that are used for historical shorthand documents recognition. We distinguish following tasks: binarization, clusterization, lines recognition and determination of symbols types (main, superscript, subscript). Each method is evaluated in terms of recall, precision and F-measure criteria. The best method for binarization of shorthand documents appeared to be the modified threshold method. We proposed following methods for graphic symbols clustering: the method of segments lengths comparison, the method of projections comparison and the method of baskets. The best result is achieved with the method of baskets. We also present the algorithms of lines recognition and symbols classification. Lines recognition is performed using two methods: nearest neighbour and relations graph construction. Symbols classification is done by single and by double approximation methods and their modification. The best result of lines segmentation is demonstrated by the method of relations graph construction, and the best result of determination of symbols types is demonstrated by the modified double approximation method.
  • Keywords
    "Approximation methods","Approximation algorithms","Image segmentation","Classification algorithms","Clustering methods","Shape"
  • Publisher
    ieee
  • Conference_Titel
    "Stability and Control Processes" in Memory of V.I. Zubov (SCP), 2015 International Conference
  • Type

    conf

  • DOI
    10.1109/SCP.2015.7342239
  • Filename
    7342239