Title :
A New Method for Arbitrarily-Oriented Text Detection in Video
Author :
Sharma, Nabin ; Shivakumara, Palaiahnakote ; Pal, Umapada ; Blumenstein, Michael ; Tan, Chew Lim
Author_Institution :
Griffith Univ., Brisbane, QLD, Australia
Abstract :
Text detection in video frames plays a vital role in enhancing the performance of information extraction systems because the text in video frames helps in indexing and retrieving video efficiently and accurately. This paper presents a new method for arbitrarily-oriented text detection in video, based on dominant text pixel selection, text representatives and region growing. The method uses gradient pixel direction and magnitude corresponding to Sobel edge pixels of the input frame to obtain dominant text pixels. Edge components in the Sobel edge map corresponding to dominant text pixels are then extracted and we call them text representatives. We eliminate broken segments of each text representatives to get candidate text representatives. Then the perimeter of candidate text representatives grows along the text direction in the Sobel edge map to group the neighboring text components which we call word patches. The word patches are used for finding the direction of text lines and then the word patches are expanded in the same direction in the Sobel edge map to group the neighboring word patches and to restore missing text information. This results in extraction of arbitrarily-oriented text from the video frame. To evaluate the method, we considered arbitrarily-oriented data, non-horizontal data, horizontal data, Hua´s data and ICDAR-2003 competition data (Camera images). The experimental results show that the proposed method outperforms the existing method in terms of recall and f-measure.
Keywords :
edge detection; feature extraction; text detection; video retrieval; Camera images; Hua data; ICDAR-2003 competition data; Sobel edge map extraction; Sobel edge pixels; arbitrarily-oriented data; arbitrarily-oriented text detection; broken segments; dominant text pixel selection; edge components; f-measure; gradient pixel direction; information extraction system; missing text information restoration; neighboring text component grouping; neighboring word patch grouping; nonhorizontal data; region growing; text line direction; text representatives; video frames; video indexing; video retrieval; word patches; Cameras; Classification algorithms; Educational institutions; Feature extraction; Image edge detection; Image resolution; Pattern recognition; Angular region growing; Arbitrarily-oriented text detection; Dominant text pixels; Gradient direction; Video text frame; Video text representative;
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7