DocumentCode :
3171672
Title :
Use of character recognition and syntax in locating address paragraphs in complex documents
Author :
Lii, Jenchyou ; Srihari, Sargur N.
Author_Institution :
Centre of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
Volume :
2
fYear :
1994
fDate :
9-13 Oct 1994
Firstpage :
374
Abstract :
An address paragraph consists of several lines (sentences) of text. Each sentence consists of numbers, words and phrases that identify a meaningful geographical region. Addresses are typically present in documents such as postal envelopes, business cards, facsimile cover pages-where there exist other text and graphics. The problem considered is that of segmenting the subimage corresponding to the address paragraph from the entire document image. Previous approaches to locating address paragraphs, primarily for postal envelopes, have avoided use of character and word information and focused on global features. The authors present a new method based on using character recognition and address syntax. In this approach, the core problem is formulated as a consistent labeling problem based on syntactic features. The proposed method has been implemented for locating the destination address block on machine-printed letter mail. Experimental results with 2000 images of letter mail are indicated
Keywords :
document image processing; address paragraphs; address syntax; business cards; character recognition; complex documents; destination address block; facsimile cover pages; geographical region; labeling problem; machine-printed letter mail; postal envelopes; syntactic features; Character recognition; Databases; Facsimile; Graphics; Image segmentation; Labeling; Postal services; Sorting; Text analysis; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1994. Vol. 2 - Conference B: Computer Vision & Image Processing., Proceedings of the 12th IAPR International. Conference on
Conference_Location :
Jerusalem
Print_ISBN :
0-8186-6270-0
Type :
conf
DOI :
10.1109/ICPR.1994.576943
Filename :
576943
Link To Document :
بازگشت