Title :
Scene Text Segmentation with Multi-level Maximally Stable Extremal Regions
Author :
Shangxuan Tian ; Shijian Lu ; Bolan Su ; Chew Lim Tan
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
The segmentation of scene text from the image background has shown great importance in scene text recognition. In this paper, we propose a multi-level MSER technology that identifies the best-quality text candidates from a set of stable regions that are extracted from different color channel images. In order to identify the best-quality text candidates, a segmentation score is defined which exploits four measures to evaluate the text probability of each stable region including: 1) Stroke width that measures the small stroke width variation of the text, 2) Boundary curvature that measures the smoothness of the stable region boundary, 3) Character confidence that measures the likelihood of a stable region being text based on a pre-trained support vector classifier, 4) Color constancy that measures the global color consistency of each selected text candidate. Finally, the MSERs with the best segmentation score from each channel are combined to form the final segmentation. The proposed method is evaluated on the ICDAR2003 and SVT datasets and experiments show that it outperforms both popular document image binarization methods and state of the art scene text segmentation methods.
Keywords :
document image processing; image colour analysis; image recognition; image segmentation; support vector machines; text detection; ICDAR2003 dataset; SVT dataset; boundary curvature; character confidence; color channel images; color constancy; document image binarization method; global color consistency; image background; multilevel MSER technology; multilevel maximally stable extremal regions; pretrained support vector classifier; scene text recognition; scene text segmentation method; segmentation score; stable region boundary; stroke width variation; text probability; Feature extraction; Image color analysis; Image segmentation; Lighting; Optical character recognition software; Robustness; Text recognition;
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
DOI :
10.1109/ICPR.2014.467