DocumentCode
3360705
Title
Using text-spotting to query the world
Author
Posner, Ingmar ; Corke, Peter ; Newman, Paul
Author_Institution
Dept. of Eng. Sci., Mobile Robot. Group, Oxford Univ., Oxford, UK
fYear
2010
fDate
18-22 Oct. 2010
Firstpage
3181
Lastpage
3186
Abstract
The world we live in is labeled extensively for the benefit of humans. Yet, to date, robots have made little use of human readable text as a resource. In this paper we aim to draw attention to text as a readily available source of semantic information in robotics by implementing a system which allows robots to read visible text in natural scene images and to use this knowledge to interpret the content of a given scene. The reliable detection and parsing of text in natural scene images is an active area of research and remains a non-trivial problem. We extend a commonly adopted approach based on boosting for the detection and optical character recognition (OCR) for the parsing of text by a probabilistic error correction scheme incorporating a sensor-model for our pipeline. In order to interpret the scene content we introduce a generative model which explains spotted text in terms of arbitrary search terms. This allows the robot to estimate the relevance of a given scene with respect to arbitrary queries such as, for example, whether it is looking at a bank or a restaurant. We present results from images recorded by a robot in a busy cityscape.
Keywords
error correction; optical character recognition; robot vision; text analysis; cityscape; human readable text; natural scene images; optical character recognition; probabilistic error correction; robots; text parsing; text spotting;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on
Conference_Location
Taipei
ISSN
2153-0858
Print_ISBN
978-1-4244-6674-0
Type
conf
DOI
10.1109/IROS.2010.5653151
Filename
5653151
Link To Document