DocumentCode :
595429
Title :
Wavelet-gradient-fusion for video text binarization
Author :
Roy, Sandip ; Shivakumara, Palaiahnakote ; Roy, Partha Pratim ; Chew Lim Tan
Author_Institution :
Tata Consultancy Services, Kolkata, India
fYear :
2012
fDate :
11-15 Nov. 2012
Firstpage :
3300
Lastpage :
3303
Abstract :
Achieving good character recognition rate in video images is not as easy as achieving the same from the scanned documents because of low resolution and complex background in video images. In this paper, we propose a new method using fusion of horizontal, vertical and diagonal information obtained by the wavelet and the gradient on text line images to enhance the text information. We apply k-means with k=2 on row-wise and column-wise pixels separately to extract possible text information. The union operation on row-wise and column-wise clusters provides the text candidates information. With the help of Canny of the input image, the method identifies the disconnections based on mutual nearest neighbor criteria on end points and it compares the disconnected area with the text candidates to restore the missing information. Next, the method uses connected component analysis to merge some subcomponents based on nearest neighbor criteria. The foreground (text) and background (non-text) is separated based on new observation that the color values at edge pixel of the components are larger than the color values of the pixel inside the component. Finally, we use Google Tesseract OCR to validate our results and the results are compared with the baseline thresholding techniques to show that the proposed method is superior to existing methods in terms of recognition rate on 236 video and 258 ICDAR 2003 text lines.
Keywords :
gradient methods; image colour analysis; image fusion; optical character recognition; pattern clustering; text analysis; video signal processing; wavelet transforms; Google Tesseract OCR; ICDAR 2003 text lines; baseline thresholding techniques; character recognition rate; color values; column-wise clusters; column-wise pixels; diagonal information; edge pixel; horizontal information; k-means; nearest neighbor criteria; row-wise clusters; row-wise pixels; text candidates information; text line images; vertical information; video images; video text binarization; wavelet-gradient-fusion; Character recognition; Engines; Image color analysis; Image edge detection; Image restoration; Optical character recognition software; Text recognition; Video Video text restoration; Video character rcognition; Video text lines; Wavelet-Gradient-Fusion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
ISSN :
1051-4651
Print_ISBN :
978-1-4673-2216-4
Type :
conf
Filename :
6460870
Link To Document :
بازگشت