DocumentCode :
178984
Title :
Graphics and Scene Text Classification in Video
Author :
Jiamin Xu ; Shivakumara, P. ; Tong Lu ; Trung Quy Phan ; Chew Lim Tan
Author_Institution :
Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
4714
Lastpage :
4719
Abstract :
Achieving good accuracy for text detection and recognition is a challenging and interesting problem in the field of video document analysis because of the presences of both graphics text that has good clarity and scene text that is unpredictable in video frames. Therefore, in this paper, we present a novel method for classifying graphics texts and scene texts by exploiting temporal information and finding the relationship between them in video. The method proposes an iterative procedure to identify Probable Graphics Text Candidates (PGTC) and Probable Scene Text Candidates (PSTC) in video based on the fact that graphics texts in general do not have large movements especially compared to scene texts which are usually embedded on background. In addition to PGTC and PSTC, the iterative process automatically identifies the number of video frames with the help of a converging criterion. The method further explores the symmetry between intra and inter character components to identify graphics text candidates and scene text candidates. Boundary growing method is employed to restore the complete text line. For each segmented text line, we finally introduce Eigen value analysis to classify graphics and scene text lines based on the distribution of respective Eigen values. Experimental results with the existing methods show that the proposed method is effective and useful to improve the accuracy of text detection and recognition.
Keywords :
document image processing; eigenvalues and eigenfunctions; iterative methods; text detection; video signal processing; Eigen value analysis; PGTC; PSTC; boundary growing method; graphics classification; iterative process; probable graphics text candidates; probable scene text candidates; scene text classification; temporal information; text detection; text recognition; video document analysis; video frames; Accuracy; Graphics; Image edge detection; Image segmentation; Iterative methods; Text categorization; Text recognition; Eigen value analysis; Error estimation; Graphics and scene text classification; K-means clustering; Temporal frames; Video text segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.806
Filename :
6977519
Link To Document :
بازگشت