• DocumentCode
    3487379
  • Title

    Recognition of Video Text through Temporal Integration

  • Author

    Phan, Trung Quy ; Shivakumara, Palaiahnakote ; Tong Lu ; Tan, Chew Lim

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    589
  • Lastpage
    593
  • Abstract
    This paper presents a method for temporal integration, which can be used to improve the recognition accuracy of video texts. Given a word detected in a video frame, we use a combination of Stroke Width Transform and SIFT (Scale Invariant Feature Transform) to track it both backward and forward in time. The text instances within the word´s frame span are then extracted and aligned at pixel level. In the second step, we integrate these instances into a text probability map. By thresholding this map, we obtain an initial binarization of the word. In the final step, the shapes of the characters are refined using the intensity values. This helps to preserve the distinctive character features (e.g., sharp edges and holes), which are useful for OCR engines to distinguish between the different character classes. Experiments on English and German videos show that the proposed method outperforms existing ones in terms of recognition accuracy.
  • Keywords
    optical character recognition; probability; text detection; transforms; video signal processing; English videos; German videos; OCR engines; SIFT; intensity values; scale invariant feature transform; stroke width transform; temporal integration; text probability map; video frame; video text recognition; word initial binarization; Accuracy; Character recognition; Engines; Feature extraction; Optical character recognition software; Shape; Text recognition; SIFT; Stroke Width Transform; multiple frame integration; temporal integration; text binarization; text enhancement; text probability; text tracking; video text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.122
  • Filename
    6628687