DocumentCode
112705
Title
Multi-Orientation Scene Text Detection with Adaptive Clustering
Author
Yin, Xu-Cheng ; Pei, Wei-Yi ; Zhang, Jun ; Hao, Hong-Wei
Author_Institution
Department of Computer Science and Technology and also with the Beijing Key Laboratory of Materials Science Knowledge Engineering, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Volume
37
Issue
9
fYear
2015
fDate
Sept. 1 2015
Firstpage
1930
Lastpage
1937
Abstract
Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks, while most current research efforts only focus on horizontal or near horizontal scene text. In this paper, first we present a unified distance metric learning framework for adaptive hierarchical clustering, which can simultaneously learn similarity weights (to adaptively combine different feature similarities) and the clustering threshold (to automatically determine the number of clusters). Then, we propose an effective multi-orientation scene text detection system, which constructs text candidates by grouping characters based on this adaptive clustering. Our text candidates construction method consists of several sequential coarse-to-fine grouping steps: morphology-based grouping via single-link clustering, orientation-based grouping via divisive hierarchical clustering, and projection-based grouping also via divisive clustering. The effectiveness of our proposed system is evaluated on several public scene text databases, e.g., ICDAR Robust Reading Competition data sets (2011 and 2013), MSRA-TD500 and NEOCR. Specifically, on the multi-orientation text data set MSRA-TD500, the
measure of our system is
percent, much better than the state-of-the-art performance. We also construct and release a practical challenging multi-orientation scene text data set (USTB-SV1K), which is available at http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/.
Keywords
Clustering algorithms; Equations; Image color analysis; Mathematical model; Measurement; Morphology; Robustness; Scene text detection; adaptive hierarchical clustering; coarse-to-fine grouping; multi-orientation;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2014.2388210
Filename
7001081
Link To Document