DocumentCode :
1738356
Title :
An efficient extraction of character string positions using morphological operator
Author :
Park, Chang-Joon ; Moon, Kyung-Ae ; Oh, Weon-Geun ; Choi, Heung-Moon
Author_Institution :
Electron. & Telecommun. Res. Inst., Taejon, South Korea
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1616
Abstract :
An efficient extraction of character string positions in a document is proposed by using a morphological operator. In regions of character strings, axial edge pixels and diagonal edge pixels are mingled together, but in other regions, they are distributed separately. Based on this difference in the directional edge pixel distribution between the character and the non-character regions, string positions are extracted directly from arbitrary blocks without any block analysis, in contrast to previous work which requires block analysis to extract string positions (F.M. Wahl et al., 1982; S. Imade et al., 1993). Experiments are conducted on the document images acquired through the scanner, and the proposed method can directly extract the character string positions from the plain text of character blocks, and even from the document containing tables and flow-charts, without any block analysis
Keywords :
document image processing; feature extraction; image scanners; mathematical morphology; optical character recognition; arbitrary blocks; axial edge pixels; block analysis; character blocks; character regions; character string position extraction; diagonal edge pixels; directional edge pixel distribution; document images; flow-charts; morphological operator; non-character regions; plain text; scanner; Character recognition; Data flow computing; Data mining; Flowcharts; Image analysis; Image converters; Information analysis; Labeling; Moon; Pixel;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 2000 IEEE International Conference on
Conference_Location :
Nashville, TN
ISSN :
1062-922X
Print_ISBN :
0-7803-6583-6
Type :
conf
DOI :
10.1109/ICSMC.2000.886253
Filename :
886253
Link To Document :
بازگشت