Title :
Caption Text Extraction Using DCT Feature in MPEG Compressed Video
Author :
Xu, Jiangbo ; Jiang, Xiuhua ; Wang, Yuxia
Author_Institution :
Inf. Eng. Sch., Commun. Univ. of China, Beijing, China
fDate :
March 31 2009-April 2 2009
Abstract :
Caption text provides valuable information about contents in video sequences. In this paper, an efficient method to locate candidate caption text regions of video directly in the DCT compressed domain is proposed. Candidate text blocks are detected in terms of DCT texture energy. A 3 times 3 median filter is used as spatial constraint to refine the text regions. An adaptive temporal constraint method is designed according to the same caption text last for at least two seconds. Finally we convert the extracted text regions into HSV color space to generate binary text images that required by commercial OCRs. Experimental results on several video sequences show that the proposed algorithm is efficient to detect and extract caption text in MPEG video sequences with various scene complexities.
Keywords :
data compression; discrete cosine transforms; feature extraction; image sequences; median filters; video coding; DCT feature; MPEG compressed video; adaptive temporal constraint method; binary text images; caption text extraction; median filter; spatial constraint; video sequences; Data mining; Design methodology; Discrete cosine transforms; Feature extraction; Filters; Image converters; Image generation; Transform coding; Video compression; Video sequences; Caption Text; Compressed Domain; DCT; Text extraction; Texture Energy;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.107