Title :
Gaussian Mixture Models for Arabic Font Recognition
Author :
Slimane, Fouad ; Kanoun, Slim ; Alimi, Adel M. ; Ingold, Rolf ; Hennebert, Jean
Author_Institution :
Dept. of Inf., Univ. of Fribourg (unifr), Fribourg, Switzerland
Abstract :
We present in this paper a new approach for Arabic font recognition. Our proposal is to use a fixed-length sliding window for the feature extraction and to model feature distributions with Gaussian Mixture Models (GMMs). This approach presents a double advantage. First, we do not need to perform a priori segmentation into characters, which is a difficult task for arabic text. Second, we use versatile and powerful GMMs able to model finely distributions of features in large multi-dimensional input spaces. We report on the evaluation of our system on the APTI (Arabic Printed Text Image) database using 10 different fonts and 10 font sizes. Considering the variability of the different font shapes and the fact that our system is independent of the font size, the obtained results are convincing and compare well with competing systems.
Keywords :
Gaussian processes; image segmentation; natural language processing; optical character recognition; text analysis; APTI database; Arabic font recognition; Arabic printed text image; Gaussian mixture models; a priori segmentation; feature extraction; model feature distributions; optical character recognition; Computational modeling; Databases; Feature extraction; Hidden Markov models; Shape; Text recognition; Training; Font recognition; GMM; HMM; OCR;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.532