DocumentCode
71565
Title
Scene Text Detection via Connected Component Clustering and Nontext Filtering
Author
Hyung Il Koo ; Duck Hoon Kim
Author_Institution
Div. of Electr. & Comput. Eng., Ajou Univ., Suwon, South Korea
Volume
22
Issue
6
fYear
2013
fDate
Jun-13
Firstpage
2296
Lastpage
2305
Abstract
In this paper, we present a new scene text detection algorithm based on two machine learning classifiers: one allows us to generate candidate word regions and the other filters out nontext ones. To be precise, we extract connected components (CCs) in images by using the maximally stable extremal region algorithm. These extracted CCs are partitioned into clusters so that we can generate candidate regions. Unlike conventional methods relying on heuristic rules in clustering, we train an AdaBoost classifier that determines the adjacency relationship and cluster CCs by using their pairwise relations. Then we normalize candidate word regions and determine whether each region contains text or not. Since the scale, skew, and color of each candidate can be estimated from CCs, we develop a text/nontext classifier for normalized images. This classifier is based on multilayer perceptrons and we can control recall and precision rates with a single free parameter. Finally, we extend our approach to exploit multichannel information. Experimental results on ICDAR 2005 and 2011 robust reading competition datasets show that our method yields the state-of-the-art performance both in speed and accuracy.
Keywords
heuristic programming; image classification; multilayer perceptrons; text detection; CC; ICDAR 2005; ICDAR 2011; component clustering; heuristic rules; machine learning classifiers; multichannel information; multilayer perceptrons; nontext filtering; normalized images; pairwise relations; scene text detection algorithm; text-nontext classifier; Algorithm design and analysis; Clustering algorithms; Feature extraction; Image color analysis; Partitioning algorithms; Robustness; Training; CC clustering; connected component (CC)-based approach; machine learning classifier; nontext filtering; scene text detection; Algorithms; Artificial Intelligence; Cluster Analysis; Databases, Factual; Image Processing, Computer-Assisted; Pattern Recognition, Automated;
fLanguage
English
Journal_Title
Image Processing, IEEE Transactions on
Publisher
ieee
ISSN
1057-7149
Type
jour
DOI
10.1109/TIP.2013.2249082
Filename
6471224
Link To Document