DocumentCode :
2480710
Title :
Gaussian Mixture Models for Arabic Font Recognition
Author :
Slimane, Fouad ; Kanoun, Slim ; Alimi, Adel M. ; Ingold, Rolf ; Hennebert, Jean
Author_Institution :
Dept. of Inf., Univ. of Fribourg (unifr), Fribourg, Switzerland
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
2174
Lastpage :
2177
Abstract :
We present in this paper a new approach for Arabic font recognition. Our proposal is to use a fixed-length sliding window for the feature extraction and to model feature distributions with Gaussian Mixture Models (GMMs). This approach presents a double advantage. First, we do not need to perform a priori segmentation into characters, which is a difficult task for arabic text. Second, we use versatile and powerful GMMs able to model finely distributions of features in large multi-dimensional input spaces. We report on the evaluation of our system on the APTI (Arabic Printed Text Image) database using 10 different fonts and 10 font sizes. Considering the variability of the different font shapes and the fact that our system is independent of the font size, the obtained results are convincing and compare well with competing systems.
Keywords :
Gaussian processes; image segmentation; natural language processing; optical character recognition; text analysis; APTI database; Arabic font recognition; Arabic printed text image; Gaussian mixture models; a priori segmentation; feature extraction; model feature distributions; optical character recognition; Computational modeling; Databases; Feature extraction; Hidden Markov models; Shape; Text recognition; Training; Font recognition; GMM; HMM; OCR;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.532
Filename :
5595946
Link To Document :
بازگشت