DocumentCode :
2022166
Title :
A Case-Based Reasoning Approach for Invoice Structure Extraction
Author :
Hamza, Hatem ; Belaid, Yolande ; Belaid, Abdel
Author_Institution :
ITESOFT, Aimargues
Volume :
1
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
327
Lastpage :
331
Abstract :
This paper shows the use of case-based reasoning (CBR) for invoice structure extraction and analysis. This method, called CBR-DIA (CBR for document invoice analysis), is adaptive and does not need any previous training. It analyses a document by retrieving and analysing similar documents or elements of documents (cases) stored in a database. The retrieval step is performed thanks to graph comparison techniques like graph probing and edit distance. The analysis step is done thanks to the information found in the nearest retrieved cases. Applied on 950 invoices, CBR-DIA reaches a recognition rate of 85.29% for documents of known classes and 76.33% for documents of unknown classes.
Keywords :
case-based reasoning; document image processing; feature extraction; graph theory; image retrieval; optical character recognition; OCR; case-based reasoning approach; document database; document invoice analysis; document retrieval; edit distance; graph comparison technique; graph probing; invoice structure extraction; Artificial intelligence; Data mining; Databases; Image analysis; Information analysis; Information retrieval; Problem-solving; Tagging; Terminology; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4378726
Filename :
4378726
Link To Document :
بازگشت