Title :
Word-Wise Script Identification from Video Frames
Author :
Sharma, Neelam ; Chanda, Sukalpa ; Pal, Umapada ; Blumenstein, Michael
Author_Institution :
Griffith Univ., Gold Coast, QLD, Australia
Abstract :
Script identification is an essential step for the efficient use of the appropriate OCR in multilingual document images. There are various techniques available for script identification from printed and handwritten document images, but script identification from video frames has not been explored much. This paper presents a study of some pre-processing techniques and features for word-wise script identification from video frames. Traditional features, namely Zernike moments, Gabor and gradient, have performed well for handwritten and printed documents having simple backgrounds and adequate resolution for OCR. Video frames are mostly coloured and suffer from low resolution, blur, background noise, to mention a few. In this paper, an attempt has been made to explore whether the traditional script identification techniques can be useful in video frames. Three feature extraction techniques, namely Zernike moments, Gabor and gradient features, and SVM classifiers were considered for analyzing three popular scripts, namely English, Bengali and Hindi. Some pre-processing techniques such as super resolution and skeletonization of the original word images were used in order to overcome the inherent problems with video. Experiments show that the super resolution technique with gradient features has performed well, and an accuracy of 87.5% was achieved when testing on 896 words from three different scripts. The study also reveals that the use of proper pre-processing approaches can be helpful in applying traditional script identification techniques to video frames.
Keywords :
Zernike polynomials; document image processing; gradient methods; handwritten character recognition; image classification; image resolution; natural language processing; support vector machines; text detection; video signal processing; Bengali script; English script; Gabor features; Hindi script; OCR; SVM classifiers; Zernike moments; background noise; coloured video frames; feature extraction; gradient features; handwritten document images; multilingual document images; pre-processing technique; printed document images; super resolution technique; video resolution; word-wise script identification; Accuracy; Feature extraction; Image resolution; Image segmentation; Optical character recognition software; Skeleton; Support vector machines; Script identification; Video document analysis; Word segmentation;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.177