Title :
Automatic invoice interpretation: invoice structure analysis
Author :
Kosiba, David A. ; Kasturi, Rangachar
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
We propose a method of invoice document structure analysis that provides a means to extract the relevant information from an unknown invoice. Our method uses a combination of textual and graphical processing by analyzing the line and line intersection features in the document as well as searching for possible keywords such as item number, quantity, total, etc. Valid keyword search regions are determined by a specialized connected-component analysis before any OCR is performed. The results of the the keyword search and the line analysis are combined to give the search regions for extracting the relevant data contained in the invoice. This analysis will become part of a larger invoice interpretation system which is currently under development
Keywords :
business forms; document image processing; edge detection; image segmentation; invoicing; automatic invoice interpretation; connected-component analysis; graphical processing; invoice document structure analysis; item number; keyword search regions; line intersection features; textual processing; Computer science; Data mining; Graphics; Keyword search; Marine vehicles; Optical character recognition software; Topology;
Conference_Titel :
Pattern Recognition, 1996., Proceedings of the 13th International Conference on
Conference_Location :
Vienna
Print_ISBN :
0-8186-7282-X
DOI :
10.1109/ICPR.1996.547263