• DocumentCode
    327945
  • Title

    Interfered-character recognition by removing interfering-lines and adjusting feature weights

  • Author

    Tseng, Yi-Hong ; Lee, Hsi-Jian

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    2
  • fYear
    1998
  • fDate
    16-20 Aug 1998
  • Firstpage
    1865
  • Abstract
    Characters sometimes overlap with nontextual lines in form documents and these interfered-character would generally be recognized with poor accuracy. In this paper we propose a two-step interfering-line removing method. Positions and orientations of interfering-lines are first detected by the Hough transform. Interfering-lines widths are then determined from projection histograms. An ambiguous area is defined to bound an interfering-line. Black runs in the ambiguous are classified into four types and grouped into run-groups. The directions of hidden character strokes in each run-groups are predicted. Black pixels located in these hidden strokes are regarded as character pixels and the other black pixels are considered as interfering pixels, which will be removed then. Most OCR engines are trained by noninterfered sample characters. In order to recognize interfered-characters, we adjust the feature values by assigning a stability value to each of subregions. In our collected 1820 interfered handwritten Chinese characters, the recognition accuracy was 24.02% for interfered-characters and 89.91% for characters after removing interfering-lines
  • Keywords
    Hough transforms; document image processing; optical character recognition; Hough transform; OCR engines; black runs; feature weight adjustment; form documents; handwritten Chinese characters; hidden character strokes; interfered-character recognition; interfering-line removal; noninterfered sample characters; nontextual lines; Character recognition; Computer science; Engines; Handwriting recognition; Histograms; Image restoration; Interference; Optical character recognition software; Pixel; Stability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
  • Conference_Location
    Brisbane, Qld.
  • ISSN
    1051-4651
  • Print_ISBN
    0-8186-8512-3
  • Type

    conf

  • DOI
    10.1109/ICPR.1998.712095
  • Filename
    712095