DocumentCode
3023272
Title
Intelligent document processing
Author
Esposito, Floriana ; Ferilli, Stefano ; Basile, Teresa M A ; Mauro, Nicola Di
Author_Institution
Dept. of Comput. Sci., Bari Univ., Italy
fYear
2005
fDate
29 Aug.-1 Sept. 2005
Firstpage
1100
Abstract
Digital repositories raise the need for an effective and efficient retrieval of the stored material. In this paper, we propose the intensive application of intelligent techniques to the steps of document layout analysis, document image classification and understanding on digital documents. Specifically, the complex interrelation existing among layout components, that are fundamental to assign them the proper semantic role, suggest the exploitation of first-order representations in some learning steps. Results obtained in a prototypical system for scientific conference management prove that the proposed approach can be beneficial both for the layout recognition and for the selection of interesting components of the document, from which extracting the text for categorizing the document according to its topic.
Keywords
document image processing; image classification; information retrieval; digital documents; digital repositories; document image classification; document layout analysis; document layout recognition; first-order representations; intelligent document processing; scientific conference management; stored material retrieval; Application software; Computer science; Conference management; Image analysis; Image classification; Iterative algorithms; Machine learning; Page description languages; Prototypes; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN
1520-5263
Print_ISBN
0-7695-2420-6
Type
conf
DOI
10.1109/ICDAR.2005.144
Filename
1575714
Link To Document