DocumentCode :
3145562
Title :
A discriminative learning approach for orientation detection of Urdu document images
Author :
Rashid, Sheikh Faisal ; Bukhari, Syed Saqib ; Shafait, Faisal ; Breuel, Thomas M.
Author_Institution :
Image Understanding & Pattern Recognition (IUPR), Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
fYear :
2009
fDate :
14-15 Dec. 2009
Firstpage :
1
Lastpage :
5
Abstract :
Orientation detection is an important preprocessing step for accurate recognition of text from document images. Many existing orientation detection techniques are based on the fact that in Roman script text ascenders occur more likely than descenders, but this approach is not applicable to document of other scripts like Urdu, Arabic, etc. In this paper, we propose a discriminative learning approach for orientation detection of Urdu documents with varying layouts and fonts. The main advantage of our approach is that it can be applied to documents of other scripts easily and accurately. Our approach is based on classification of individual connected component orientation in the document image, and then the orientation of the page image is determined via majority count. A convolutional neural network is trained as discriminative learning model for the labeled Urdu books dataset with four target orientations: 0, 90, 180 and 270 degrees. We demonstrate the effectiveness of our method on dataset of Urdu documents categorized into the layouts of book, novel and poetry. We achieved 100% orientation detection accuracy on a test set of 328 document images.
Keywords :
classification; document image processing; learning (artificial intelligence); natural language processing; neural nets; text analysis; Roman script; Urdu books dataset; Urdu document images; classification; convolutional neural network; discriminative learning approach; orientation detection; text ascenders; text recognition; Artificial intelligence; Books; Cellular neural networks; Image recognition; Learning; Neural networks; Optical character recognition software; Pattern recognition; Shape; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multitopic Conference, 2009. INMIC 2009. IEEE 13th International
Conference_Location :
Islamabad
Print_ISBN :
978-1-4244-4872-2
Electronic_ISBN :
978-1-4244-4873-9
Type :
conf
DOI :
10.1109/INMIC.2009.5383110
Filename :
5383110
Link To Document :
بازگشت