Title :
An efficient coarse-to-fine scheme for text detection in videos
Author :
Wang, Liuan ; Huang, Lin-Lin ; Wu, Yang
Author_Institution :
Sch. of Autom. Sci. & Electr. Eng., Beihang Univ., Beijing, China
Abstract :
To achieve fast and accurate text detection from videos, we propose an efficient coarse-to-fine scheme comprising three stages: key frame extraction, candidate text line detection and fine text detection. Key frames, which are assumed to carry texts, are extracted based on multi-threshold difference of color histogram (MDCH). From the key frames, candidate text lines are detected by morphological operations and connected component analysis. Sliding window classification is performed on the candidate text lines so as to detect refined text lines. We use two types of features: histogram of gradients (HOG) and local assembled binary (LAB), and two classifiers: Real Adaboost and polynomial neural network (PNN), for improving the classification accuracy. The effectiveness of the proposed method has been demonstrated by the experiment results on a large video dataset. Also, the benefits of key frame extraction and combining multiple features and classifiers have been justified.
Keywords :
image enhancement; neural nets; pattern classification; text detection; video signal processing; MDCH; PNN; candidate text line detection; coarse-to-fine scheme; connected component analysis; fine text detection; histogram of gradients; key frame extraction; local assembled binary; multithreshold difference of color histogram; polynomial neural network; real Adaboost; sliding window classification; text detection; Accuracy; Feature extraction; Histograms; Image color analysis; Image edge detection; Machine learning; Videos; coarse-to-fine scheme; key frame extraction; multi-classifier fusion; video text detection;
Conference_Titel :
Pattern Recognition (ACPR), 2011 First Asian Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-0122-1
DOI :
10.1109/ACPR.2011.6166605