Title :
An Improved Online Tamil Character Recognition Engine Using Post-Processing Methods
Author :
Sundaram, Suresh ; Ramakrishnan, A.G.
Author_Institution :
Indian Inst. of Sci., Bangalore, India
Abstract :
We propose script-specific post processing schemes for improving the recognition rate of online Tamil characters. At the first level, features derived at each sample point of the preprocessed character are used to construct a subspace using the 2DPCA algorithm. Recognition of the test sample is performed using a nearest neighbor classifier. Based on the analysis of the confusion matrix, multiple pairs of confused characters are identified. At the second level, we use script specific cues to sort out the ambiguities among the confused characters. This strategy reduces the recognition error among the confused character sets handled, by more than 4%. This approach can be applied irrespective of the nature of the classifier used for the first level of recognition, though the nature of the confusion set might vary.
Keywords :
character recognition; image classification; matrix algebra; natural languages; principal component analysis; 2DPCA algorithm; confusion matrix analysis; nearest neighbor classifier; online Tamil character recognition engine; recognition error reduction; script-specific post-processing method; Character recognition; Concatenated codes; Engines; Feature extraction; Nearest neighbor searches; Performance evaluation; Polynomials; Structural shapes; Testing; Text analysis; 2DPCA; Interest Points; Post Processing;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.65