DocumentCode :
2010772
Title :
New Spatial-Gradient-Features for Video Script Identification
Author :
Zhao, Danni ; Shivakumara, Palaiahnakote ; Lu, Shijian ; Tan, Chew Lim
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
fYear :
2012
fDate :
27-29 March 2012
Firstpage :
38
Lastpage :
42
Abstract :
In this paper, we present new features based on Spatial-Gradient-Features (SGF) at block level for identifying six video scripts namely, Arabic, Chinese, English, Japanese, Korean and Tamil. This works helps in enhancing the capability of the current OCR on video text recognition by choosing an appropriate OCR engine when video contains multi-script frames. The input for script identification is the text blocks obtained by our text frame classification method. For each text block, we obtain horizontal and vertical gradient information to enhance the contrast of the text pixels. We divide the horizontal gradient block into two equal parts as upper and lower at the centroid in the horizontal direction. Histogram on the horizontal gradient values of the upper and the lower part is performed to select dominant text pixels. In the same way, the method selects dominant pixels from the right and the left parts obtained by dividing the vertical gradient block vertically. The method combines the horizontal and the vertical dominant pixels to obtain text components. Skeleton concept is used to reduce pixel width to a single pixel to extract spatial features. We extract four features based on proximity between end points, junction points, intersection points and pixels. The method is evaluated on 770 frames of six scripts in terms of classification rate and is compared with an existing method. We have achieved 82.1% average classification rate.
Keywords :
gradient methods; natural language processing; optical character recognition; video signal processing; Arabic video scripts; Chinese video scripts; English video scripts; Japanese video scripts; Korean video scripts; OCR; Tamil video scripts; classification rate; gradient information; histogram; skeleton concept; spatial-gradient-features; text block; text components; text pixels; video script identification; video text recognition; Equations; Feature extraction; Histograms; Junctions; Optical character recognition software; Text recognition; Dominat video text pixels; Gradient blocks; Spatial-gradient-features; Video scrpt identification; Video text blocks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
Type :
conf
DOI :
10.1109/DAS.2012.57
Filename :
6195331
Link To Document :
بازگشت