• DocumentCode
    1343213
  • Title

    Automatic caption localization in compressed video

  • Author

    Zhong, Yu ; Zhang, Hongjiang ; Jain, Anil K.

  • Author_Institution
    Robotics Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    22
  • Issue
    4
  • fYear
    2000
  • fDate
    4/1/2000 12:00:00 AM
  • Firstpage
    385
  • Lastpage
    392
  • Abstract
    We present a method to automatically localize captions in JPEG compressed images and the I-frames of MPEG compressed videos. Caption text regions are segmented from background images using their distinguishing texture characteristics. Unlike previously published methods which fully decompress the video sequence before extracting the text regions, this method locates candidate caption text regions directly in the DCT compressed domain using the intensity variation information encoded in the DCT domain. Therefore, only a very small amount of decoding is required. The proposed algorithm takes about 0.006 second to process a 240×350 image and achieves a recall rate of 99.17 percent while falsely accepting about 1.87 percent nontext DCT blocks on a variety of MPEG compressed videos containing more than 2,300 I-frames
  • Keywords
    data compression; image segmentation; video coding; 0.006 s; 240 pixel; 350 pixel; 84 kpixel; DCT compressed domain; I-frames; JPEG compressed images; MPEG compressed videos; automatic caption localization; caption text regions; compressed video; intensity variation information; Data mining; Discrete cosine transforms; Image coding; Image edge detection; Image segmentation; Information retrieval; Layout; Transform coding; Video compression; Video sequences;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.845381
  • Filename
    845381