Caption Text Extraction Using DCT Feature in MPEG Compressed Video

Author

Xu, Jiangbo ; Jiang, Xiuhua ; Wang, Yuxia

Author_Institution

Inf. Eng. Sch., Commun. Univ. of China, Beijing, China

Volume

6

fYear

2009

fDate

March 31 2009-April 2 2009

Firstpage

431

Lastpage

434

Abstract

Caption text provides valuable information about contents in video sequences. In this paper, an efficient method to locate candidate caption text regions of video directly in the DCT compressed domain is proposed. Candidate text blocks are detected in terms of DCT texture energy. A 3 times 3 median filter is used as spatial constraint to refine the text regions. An adaptive temporal constraint method is designed according to the same caption text last for at least two seconds. Finally we convert the extracted text regions into HSV color space to generate binary text images that required by commercial OCRs. Experimental results on several video sequences show that the proposed algorithm is efficient to detect and extract caption text in MPEG video sequences with various scene complexities.

Keywords

data compression; discrete cosine transforms; feature extraction; image sequences; median filters; video coding; DCT feature; MPEG compressed video; adaptive temporal constraint method; binary text images; caption text extraction; median filter; spatial constraint; video sequences; Data mining; Design methodology; Discrete cosine transforms; Feature extraction; Filters; Image converters; Image generation; Transform coding; Video compression; Video sequences; Caption Text; Compressed Domain; DCT; Text extraction; Texture Energy;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Information Engineering, 2009 WRI World Congress on

Conference_Location

Los Angeles, CA

Print_ISBN

978-0-7695-3507-4

Type

conf

DOI

10.1109/CSIE.2009.107

Filename

5170735