• DocumentCode
    2486427
  • Title

    Caption Localization and Detection for News Videos Using Frequency Analysis and Wavelet Features

  • Author

    Lee, Chien-Cheng ; Chiang, Yu-Chun ; Shih, Cheng-Yuan ; Huang, Hau-Ming

  • Author_Institution
    Yuan Ze Univ., Chung-li
  • Volume
    2
  • fYear
    2007
  • fDate
    29-31 Oct. 2007
  • Firstpage
    539
  • Lastpage
    539
  • Abstract
    In this paper, we propose an algorithm to detect captions from news videos. The propose method only detects captions excluding other miscellaneous types of text. The algorithm makes use of the fact that the text remains in many consecutive frames to reduce the number of the processing frames. The caption beginning frame is detected first, then a caption candidate strip in the caption beginning frame is defined. Moreover, the difference of the caption candidate strip between consecutive frames is computed, and then the difference information is transformed to frequency domain by discrete cosine transform. Frequency analysis is used to define the caption candidate region, and twelve wavelet features are extracted from the region and considered as the input of the classifier to detect the text blocks. Experimental results show that the proposed approach can fast and robustly detect captions from news video.
  • Keywords
    character recognition; discrete cosine transforms; feature extraction; image classification; video signal processing; wavelet transforms; caption beginning frame; caption candidate strip; caption localization; discrete cosine transform; frequency analysis; news videos; text block classifier; wavelet feature extraction; Data mining; Discrete cosine transforms; Feature extraction; Frequency domain analysis; Indexing; Information retrieval; Redundancy; Strips; Videos; Wavelet analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
  • Conference_Location
    Patras
  • ISSN
    1082-3409
  • Print_ISBN
    978-0-7695-3015-4
  • Type

    conf

  • DOI
    10.1109/ICTAI.2007.157
  • Filename
    4410436