Title :
Text detection on camera acquired document images using supervised classification of connected components in wavelet domain
Author :
Roy, Utpal ; Harit, Gaurav
Author_Institution :
Indian Inst. of Technol. Rajasthan, Jodhpur, India
Abstract :
In this paper we present an algorithm to detect text on video frames consisting of lecture slides. We begin by performing a multi-channel wavelet transform and then merge the channel components for the high frequency sub bands to obtain a composite energy map. Thresholding the energy map results in an edge map consisting of candidate text pixels - some of these correspond to actual text and others correspond to graphics, logo, tables, etc. The connected components in the edge map are then filtered to reject some of the false positives using a trained classifier. Rectangular text blocks compactly surrounding the text regions are then identified using a process of selective dilation and recursive splitting. False positive text blocks still remaining are then rejected using heuristics. Experiments conducted on 890 images show that our scheme has lower false positive rate and misdetection rate when compared with two existing scene text detection methods.
Keywords :
character recognition; document image processing; filtering theory; image classification; image segmentation; video signal processing; wavelet transforms; camera; candidate text pixel; channel component; classifier training; composite energy map; connected component filtering; document image; edge map; energy map thresholding; heuristics; lecture slide; multichannel wavelet transform; rectangular text block identification; recursive splitting; selective dilation; supervised classification; text detection; video frame; wavelet domain; Cameras; Feature extraction; Image edge detection; Kernel; Noise; Wavelet transforms;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4