Title :
Identifying Handwritten Text in Mixed Documents
Author :
Farooq, Faisal ; Sridharan, Karthik ; Govindaraju, Vengatesan
Author_Institution :
CEDAR, Buffalo Univ., Amherst, NY
Abstract :
In this paper we present a system for classification of machine printed and handwritten text in mixed documents. The classification is performed at the word level. We propose a feature extraction algorithm for each word image based on Gabor filters followed by classification using an expectation maximization (EM) based probabilistic neural network that reduces overfitting of training data. An overall precision of 94.62% was obtained for the Arabic script using the modified neural network. The accuracies obtained using a simple backpropagation neural network and an SVM were 83.33% and 90.26% respectively
Keywords :
Gabor filters; backpropagation; document image processing; expectation-maximisation algorithm; feature extraction; handwriting recognition; image classification; neural nets; probability; support vector machines; Arabic script; Gabor filters; SVM; backpropagation neural network; expectation maximization; feature extraction; handwritten text identification; machine printed classification; modified neural network; probabilistic neural network; Pattern recognition;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.676