DocumentCode
594766
Title
Multipage document retrieval by textual and visual representations
Author
Rusinol, Marcal ; Karatzas, Dimosthenis ; Bagdanov, Andrew D. ; Llados, Josep
Author_Institution
Dept. Cienc. de la Computacio, Univ. Autonoma de Barcelona, Bellaterra, Spain
fYear
2012
fDate
11-15 Nov. 2012
Firstpage
521
Lastpage
524
Abstract
In this paper we present a multipage administrative document image retrieval system based on textual and visual representations of document pages. Individual pages are represented by textual or visual information using a bag-of-words framework. Different fusion strategies are evaluated which allow the system to perform multipage document retrieval on the basis of a single page retrieval system. Results are reported on a large dataset of document images sampled from a banking workflow.
Keywords
banking; document image processing; information retrieval; bag-of-words framework; banking workflow; document images; document pages; fusion strategies; multipage administrative document image retrieval system; single page retrieval system; textual representations; visual representations; Banking; Histograms; Image retrieval; Semantics; Vectors; Visualization; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location
Tsukuba
ISSN
1051-4651
Print_ISBN
978-1-4673-2216-4
Type
conf
Filename
6460186
Link To Document