مرکز منطقه ای اطلاع رساني علوم و فناوري - Integrating Visual and Textual Cues for Query-by-String Word Spotting

DocumentCode :

3487122

Title :

Integrating Visual and Textual Cues for Query-by-String Word Spotting

Author :

Aldavert, David ; Rusinol, Marcal ; Toledo, Rafael ; Llados, Josep

Author_Institution :

Dept. Cienc. de la Computacio, Univ. Autonoma de Barcelona, Bellaterra, Spain

fYear :

2013

fDate :

25-28 Aug. 2013

Firstpage :

511

Lastpage :

515

Abstract :

In this paper, we present a word spotting framework that follows the query-by-string paradigm where word images are represented both by textual and visual representations. The textual representation is formulated in terms of character n-grams while the visual one is based on the bag-of-visual-words scheme. These two representations are merged together and projected to a sub-vector space. This transform allows to, given a textual query, retrieve word instances that were only represented by the visual modality. Moreover, this statistical representation can be used together with state-of-the-art indexation structures in order to deal with large-scale scenarios. The proposed method is evaluated using a collection of historical documents outperforming state-of-the-art performances.

Keywords :

document image processing; image representation; information retrieval; statistical analysis; bag-of-visual-words scheme; historical document extraction; indexation structures; query-by-string word spotting framework; statistical representation; sub-vector space; textual cues; textual query; textual representation; visual cues; visual modality; visual representations; word images; word instances; Computational modeling; Computer vision; Hidden Markov models; Histograms; Pattern recognition; Vectors; Visualization; bag-of-visual-words; handwritten word spotting; latent semantic analysis; query-by-string;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Document Analysis and Recognition (ICDAR), 2013 12th International Conference on

Conference_Location :

Washington, DC

ISSN :

1520-5363

Type :

conf

DOI :

10.1109/ICDAR.2013.108

Filename :

6628673

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3487122