DocumentCode :
178400
Title :
Video Text Extraction Using the Fusion of Color Gradient and Log-Gabor Filter
Author :
Zhike Zhang ; Weiqiang Wang ; Ke Lu
Author_Institution :
Univ. of Chinese Acad. of Sci., Beijing, China
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
2938
Lastpage :
2943
Abstract :
Video text which contains rich semantic information can be utilized for video indexing and summarization. However, compared with scanned documents, text recogniton for video text is still a challenging problem due to complex background. Segmenting text line into single characters before text extraction can achieve higher recognition accuracy, since background of single character is less complex compared with whole text line. Therefore, we first perform character segmentation, which can accurately locate the character gap in the text line. More specifically, we get a fusion map which fuses the results of color gradient and log-gabor filter. Then, candidate segmentation points are obtained by vertical projection analysis of the fusion map. We get segmentation points by finding minimum projection value of candidate points in a limited range. Finally, we get the binary image of the single character image by applying K-means clustering and combine their results to form binary image of the whole text line. The binary image is further refined by inward filling and the fusion map. The experimental results on a large amount of data show that the proposed method can contribute to better binarization result which leads to a higher character recognition rate of OCR engine.
Keywords :
Gabor filters; image colour analysis; image fusion; image segmentation; indexing; optical character recognition; pattern clustering; text detection; video retrieval; K-means clustering; OCR engine; binary image; character segmentation; color gradient; fusion map; inward filling; log-Gabor filter; recognition accuracy; segmentation points; semantic information; single character image; text line segmentation; text recogniton; vertical projection analysis; video indexing; video summarization; video text extraction; Accuracy; Character recognition; Image color analysis; Image recognition; Image segmentation; Optical character recognition software; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.506
Filename :
6977219
Link To Document :
بازگشت