DocumentCode :
3360705
Title :
Using text-spotting to query the world
Author :
Posner, Ingmar ; Corke, Peter ; Newman, Paul
Author_Institution :
Dept. of Eng. Sci., Mobile Robot. Group, Oxford Univ., Oxford, UK
fYear :
2010
fDate :
18-22 Oct. 2010
Firstpage :
3181
Lastpage :
3186
Abstract :
The world we live in is labeled extensively for the benefit of humans. Yet, to date, robots have made little use of human readable text as a resource. In this paper we aim to draw attention to text as a readily available source of semantic information in robotics by implementing a system which allows robots to read visible text in natural scene images and to use this knowledge to interpret the content of a given scene. The reliable detection and parsing of text in natural scene images is an active area of research and remains a non-trivial problem. We extend a commonly adopted approach based on boosting for the detection and optical character recognition (OCR) for the parsing of text by a probabilistic error correction scheme incorporating a sensor-model for our pipeline. In order to interpret the scene content we introduce a generative model which explains spotted text in terms of arbitrary search terms. This allows the robot to estimate the relevance of a given scene with respect to arbitrary queries such as, for example, whether it is looking at a bank or a restaurant. We present results from images recorded by a robot in a busy cityscape.
Keywords :
error correction; optical character recognition; robot vision; text analysis; cityscape; human readable text; natural scene images; optical character recognition; probabilistic error correction; robots; text parsing; text spotting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on
Conference_Location :
Taipei
ISSN :
2153-0858
Print_ISBN :
978-1-4244-6674-0
Type :
conf
DOI :
10.1109/IROS.2010.5653151
Filename :
5653151
Link To Document :
بازگشت