Title :
Appearance Based Models in Document Script Identification
Author :
Vikram, T.N. ; Guru, D.S.
Author_Institution :
Univ. of Mysore, Mysore
Abstract :
In this paper we employ appearance based models for document script identification. They are employed to identify scripts at both paragraph and word level. Elaborate experimentation has been conducted which has revealed that they are robust enough to handle highly confusing scripts and their performance does not degrade drastically even in the presence of noise. A generic script identification has been attempted, to identify both Asian and European scripts by considering a dataset of twenty different languages.
Keywords :
document image processing; natural language processing; Asian scripts; European scripts; appearance based models; confusing scripts; document script identification; generic script identification; Automation; Character recognition; Computer science; Covariance matrix; Degradation; Europe; Information management; Noise robustness; Principal component analysis; Sorting;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4377007