DocumentCode :
1636588
Title :
An Improved Online Tamil Character Recognition Engine Using Post-Processing Methods
Author :
Sundaram, Suresh ; Ramakrishnan, A.G.
Author_Institution :
Indian Inst. of Sci., Bangalore, India
fYear :
2009
Firstpage :
1216
Lastpage :
1220
Abstract :
We propose script-specific post processing schemes for improving the recognition rate of online Tamil characters. At the first level, features derived at each sample point of the preprocessed character are used to construct a subspace using the 2DPCA algorithm. Recognition of the test sample is performed using a nearest neighbor classifier. Based on the analysis of the confusion matrix, multiple pairs of confused characters are identified. At the second level, we use script specific cues to sort out the ambiguities among the confused characters. This strategy reduces the recognition error among the confused character sets handled, by more than 4%. This approach can be applied irrespective of the nature of the classifier used for the first level of recognition, though the nature of the confusion set might vary.
Keywords :
character recognition; image classification; matrix algebra; natural languages; principal component analysis; 2DPCA algorithm; confusion matrix analysis; nearest neighbor classifier; online Tamil character recognition engine; recognition error reduction; script-specific post-processing method; Character recognition; Concatenated codes; Engines; Feature extraction; Nearest neighbor searches; Performance evaluation; Polynomials; Structural shapes; Testing; Text analysis; 2DPCA; Interest Points; Post Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
ISSN :
1520-5363
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2009.65
Filename :
5277637
Link To Document :
بازگشت