• DocumentCode
    1738356
  • Title

    An efficient extraction of character string positions using morphological operator

  • Author

    Park, Chang-Joon ; Moon, Kyung-Ae ; Oh, Weon-Geun ; Choi, Heung-Moon

  • Author_Institution
    Electron. & Telecommun. Res. Inst., Taejon, South Korea
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1616
  • Abstract
    An efficient extraction of character string positions in a document is proposed by using a morphological operator. In regions of character strings, axial edge pixels and diagonal edge pixels are mingled together, but in other regions, they are distributed separately. Based on this difference in the directional edge pixel distribution between the character and the non-character regions, string positions are extracted directly from arbitrary blocks without any block analysis, in contrast to previous work which requires block analysis to extract string positions (F.M. Wahl et al., 1982; S. Imade et al., 1993). Experiments are conducted on the document images acquired through the scanner, and the proposed method can directly extract the character string positions from the plain text of character blocks, and even from the document containing tables and flow-charts, without any block analysis
  • Keywords
    document image processing; feature extraction; image scanners; mathematical morphology; optical character recognition; arbitrary blocks; axial edge pixels; block analysis; character blocks; character regions; character string position extraction; diagonal edge pixels; directional edge pixel distribution; document images; flow-charts; morphological operator; non-character regions; plain text; scanner; Character recognition; Data flow computing; Data mining; Flowcharts; Image analysis; Image converters; Information analysis; Labeling; Moon; Pixel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2000 IEEE International Conference on
  • Conference_Location
    Nashville, TN
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-6583-6
  • Type

    conf

  • DOI
    10.1109/ICSMC.2000.886253
  • Filename
    886253