• DocumentCode
    3058316
  • Title

    A character segmentation method for Japanese printed documents coping with touching character problems

  • Author

    Ariyoshi, Shunji

  • Author_Institution
    Res. & Dev. Center, Toshiba Corp., Kanagawa, Japan
  • fYear
    1992
  • fDate
    30 Aug-3 Sep 1992
  • Firstpage
    313
  • Lastpage
    316
  • Abstract
    Proposes a character segmentation method for Japanese printed documents. Since character segmentation is a kind of a search problem, avoiding `combinatorial explosion´ is essential in realizing practical systems. Segmentation is very complicated especially when characters touch each other. The method described gives a multi-stage algorithm, where the earlier stages treat more reliable segmentation than the later stages which utilize information obtained from the results of earlier stages. Segmentation hypotheses are generated in each stage on the basis of the results of earlier stages, and they are verified by the character recognition results. Experiments on more than one hundred documents have proven that this method is efficient and accurate for practical applications
  • Keywords
    character recognition; document image processing; image segmentation; Japanese printed documents; character recognition; character segmentation; multistage algorithm; touching character problems; Books; Character recognition; Dynamic programming; Optical character recognition software; Research and development; Search problems; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
  • Conference_Location
    The Hague
  • Print_ISBN
    0-8186-2915-0
  • Type

    conf

  • DOI
    10.1109/ICPR.1992.201780
  • Filename
    201780