DocumentCode :
1470706
Title :
Text String Detection From Natural Scenes by Structure-Based Partition and Grouping
Author :
Yi, Chucai ; Tian, YingLi
Author_Institution :
Grad. Center, City Univ. of New York, New York, NY, USA
Volume :
20
Issue :
9
fYear :
2011
Firstpage :
2594
Lastpage :
2605
Abstract :
Text information in natural scene images serves as important clues for many image-based applications such as scene understanding, content-based image retrieval, assistive navigation, and automatic geocoding. However, locating text from a complex background with multiple colors is a challenging task. In this paper, we explore a new framework to detect text strings with arbitrary orientations in complex natural scene images. Our proposed framework of text string detection consists of two steps: 1) image partition to find text character candidates based on local gradient features and color uniformity of character components and 2) character candidate grouping to detect text strings based on joint structural features of text characters in each text string such as character size differences, distances between neighboring characters, and character alignment. By assuming that a text string has at least three characters, we propose two algorithms of text string detection: 1) adjacent character grouping method and 2) text line grouping method. The adjacent character grouping method calculates the sibling groups of each character candidate as string segments and then merges the intersecting sibling groups into text string. The text line grouping method performs Hough transform to fit text line among the centroids of text candidates. Each fitted text line describes the orientation of a potential text string. The detected text string is presented by a rectangle region covering all characters whose centroids are cascaded in its text line. To improve efficiency and accuracy, our algorithms are carried out in multi-scales. The proposed methods outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation. Furthermore, the effectiveness of our methods to detect text strings with arbitrary orientations is evaluated on the Oriented Scene Text Dataset collected by ourselves containing text strings in nonhorizontal ori- ntations.
Keywords :
Hough transforms; character recognition; image colour analysis; object detection; Hough transform; adjacent character grouping method; assistive navigation; automatic geocoding; character candidate grouping; color uniformity; content-based image retrieval; image partition; image-based applications; natural scene images; natural scenes; oriented scene text dataset; public robust reading dataset; structure-based grouping; structure-based partition; text line grouping method; text string detection; Algorithm design and analysis; Feature extraction; Image color analysis; Image edge detection; Joining processes; Partitioning algorithms; Pixel; Adjacent character grouping; character property; image partition; text line grouping; text string detection; text string structure;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2011.2126586
Filename :
5729827
Link To Document :
بازگشت