DocumentCode :
1762990
Title :
Word Spotting and Recognition with Embedded Attributes
Author :
Almazan, Jon ; Gordo, Albert ; Fornes, Alicia ; Valveny, Ernest
Author_Institution :
Comput. Vision Center, Univ. Autonoma de Barcelona, Barcelona, Spain
Volume :
36
Issue :
12
fYear :
2014
fDate :
Dec. 1 2014
Firstpage :
2552
Lastpage :
2566
Abstract :
This paper addresses the problems of word spotting and word recognition on images. In word spotting, the goal is to find all instances of a query word in a dataset of images. In recognition, the goal is to recognize the content of the word image, usually aided by a dictionary or lexicon. We describe an approach in which both word images and text strings are embedded in a common vectorial subspace. This is achieved by a combination of label embedding and attributes learning, and a common subspace regression. In this subspace, images and strings that represent the same word are close together, allowing one to cast recognition and retrieval tasks as a nearest neighbor problem. Contrary to most other existing methods, our representation has a fixed length, is low dimensional, and is very fast to compute and, especially, to compare. We test our approach on four public datasets of both handwritten documents and natural images showing results comparable or better than the state-of-the-art on spotting and recognition tasks.
Keywords :
document image processing; handwritten character recognition; query processing; text analysis; attributes learning; cast recognition; common subspace regression; dictionary; embedded attributes; handwritten documents; label embedding; lexicon; natural images; nearest neighbor problem; public datasets; query word; retrieval tasks; text strings; vectorial subspace; word image; word recognition; word spotting; Character recognition; Handwriting recognition; Hidden Markov models; Histograms; Image recognition; Nearest neighbor searches; Text recognition; Word image representation; attribute-based representation; handwritten text; scene text; word recognition; word spotting;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2014.2339814
Filename :
6857995
Link To Document :
بازگشت