DocumentCode :
3776027
Title :
Video text detection with text edges and convolutional neural network
Author :
Ping Hu;Weiqiang Wang;Ke Lu
Author_Institution :
University of Chinese, Academy of Sciences, Beijing, China
fYear :
2015
Firstpage :
675
Lastpage :
679
Abstract :
Text and captions in videos provide useful information for content analysis and understanding. In this paper, we present an approach to detecting video text in a coarse-to-fine strategy. In the coarse phase we propose an efficient method to detect multi-scale candidate text regions with high recall. Then the candidate text regions are segmented and sent to the fine phase where a convolutional neural network(CNN) is applied to generate a confidence map for each candidate text region. Finally, the candidate text regions are further refined and partitioned into text lines by projection analysis. The CNN classifier in the fine phase enables feature sharing and robustly identifies text regions. The coarse phase sharply reduce the number of windows needed to be scanned by the CNN. The combination endows the proposed method with both efficiency and robustness when detecting video text. It was verified by experiment results on two publicly testing datasets and a dataset created by us.
Keywords :
"Image edge detection","Feature extraction","Robustness","Training","TV","Neural networks","Image color analysis"
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on
Electronic_ISBN :
2327-0985
Type :
conf
DOI :
10.1109/ACPR.2015.7486588
Filename :
7486588
Link To Document :
بازگشت