• DocumentCode
    178984
  • Title

    Graphics and Scene Text Classification in Video

  • Author

    Jiamin Xu ; Shivakumara, P. ; Tong Lu ; Trung Quy Phan ; Chew Lim Tan

  • Author_Institution
    Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    4714
  • Lastpage
    4719
  • Abstract
    Achieving good accuracy for text detection and recognition is a challenging and interesting problem in the field of video document analysis because of the presences of both graphics text that has good clarity and scene text that is unpredictable in video frames. Therefore, in this paper, we present a novel method for classifying graphics texts and scene texts by exploiting temporal information and finding the relationship between them in video. The method proposes an iterative procedure to identify Probable Graphics Text Candidates (PGTC) and Probable Scene Text Candidates (PSTC) in video based on the fact that graphics texts in general do not have large movements especially compared to scene texts which are usually embedded on background. In addition to PGTC and PSTC, the iterative process automatically identifies the number of video frames with the help of a converging criterion. The method further explores the symmetry between intra and inter character components to identify graphics text candidates and scene text candidates. Boundary growing method is employed to restore the complete text line. For each segmented text line, we finally introduce Eigen value analysis to classify graphics and scene text lines based on the distribution of respective Eigen values. Experimental results with the existing methods show that the proposed method is effective and useful to improve the accuracy of text detection and recognition.
  • Keywords
    document image processing; eigenvalues and eigenfunctions; iterative methods; text detection; video signal processing; Eigen value analysis; PGTC; PSTC; boundary growing method; graphics classification; iterative process; probable graphics text candidates; probable scene text candidates; scene text classification; temporal information; text detection; text recognition; video document analysis; video frames; Accuracy; Graphics; Image edge detection; Image segmentation; Iterative methods; Text categorization; Text recognition; Eigen value analysis; Error estimation; Graphics and scene text classification; K-means clustering; Temporal frames; Video text segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.806
  • Filename
    6977519