• DocumentCode
    3476572
  • Title

    Temporally consistent caption detection in videos using a spatiotemporal 3D method

  • Author

    Zhang, Dong-Qing ; Bhagavathy, Sitaram ; Llach, Joan

  • Author_Institution
    Thomson Corp. Res., Princeton, NJ, USA
  • fYear
    2009
  • fDate
    7-10 Nov. 2009
  • Firstpage
    1881
  • Lastpage
    1884
  • Abstract
    Captions are text or logos superimposed on videos during a postproduction process. Caption detection in videos is useful for a variety of applications. For many applications, temporal consistency and stability is very important. Most of the prior work adopts certain post-processing procedures to smooth detected caption bounding boxes over time. Although these approaches mitigate the effect of the temporal inconsistency problem, they are unable to eliminate the problem. In this paper, we present a new caption detection algorithm that detects the 3D bounding boxes of caption regions in spatiotemporal volume space. 2D bounding boxes are then created by slicing the 3D bounding boxes. Since all the 2D bounding boxes corresponding to a caption area are sliced from one 3D bounding box, they are identical over time, thus ensuring temporal consistency of the result. The experiment results show that our new approach not only generates temporally consistent results but also results in higher detection accuracy.
  • Keywords
    text analysis; video signal processing; 3D bounding boxes; caption regions; logos; spatiotemporal 3D method; spatiotemporal volume space; temporal consistency; temporally consistent caption detection; video postproduction process; Detection algorithms; Feature extraction; Image edge detection; Indexing; Optical character recognition software; Pixel; Smoothing methods; Spatiotemporal phenomena; Stability; Videos; Caption detection; logo detection; spatiotemporal processing; video OCR; video text detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2009 16th IEEE International Conference on
  • Conference_Location
    Cairo
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-5653-6
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2009.5413544
  • Filename
    5413544