• DocumentCode
    2631364
  • Title

    Two-stage box connectivity algorithm for optical character recognition

  • Author

    Krtolica, Radovan ; Malitsky, Sophia

  • Author_Institution
    Canon Res. Center America, Inc., Palo Alto, CA, USA
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    179
  • Lastpage
    182
  • Abstract
    The box connectivity approach (BCA) uses a regular grid to partition the character bitmap into n × n boxes. The bitmap is then represented by a graph whose vertices and edges correspond to the boxes and their connectivity. The adjacency matrices of the graphs are represented by two binary matrices of lower size: one for the vertical and one for the horizontal connections. A third binary matrix is used to represent pixel densities in each of the boxes. Hamming distances are used for multicriterion classification of the corresponding binary vectors: only noninferior (Pareto optimal) vectors are retained. Size of the character image and first order Markov chain model of the adjacent characters are used to disambiguate noninferior characters in the second stage of the algorithm
  • Keywords
    Markov processes; graph theory; matrix algebra; optical character recognition; probability; 2-stage box connectivity algorithm; BCA; Hamming distances; Pareto optimal; adjacency matrices; binary matrices; binary vectors; box connectivity approach; character bitmap; character image; first order Markov chain model; graph; multicriterion classification; noninferior characters; optical character recognition; pixel densities; regular grid; Character recognition; Computational efficiency; Degradation; Multi-stage noise shaping; Nearest neighbor searches; Noise robustness; Optical character recognition software; Optical noise; Partitioning algorithms; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395754
  • Filename
    395754