DocumentCode :
2013697
Title :
Appearance Based Models in Document Script Identification
Author :
Vikram, T.N. ; Guru, D.S.
Author_Institution :
Univ. of Mysore, Mysore
Volume :
2
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
709
Lastpage :
713
Abstract :
In this paper we employ appearance based models for document script identification. They are employed to identify scripts at both paragraph and word level. Elaborate experimentation has been conducted which has revealed that they are robust enough to handle highly confusing scripts and their performance does not degrade drastically even in the presence of noise. A generic script identification has been attempted, to identify both Asian and European scripts by considering a dataset of twenty different languages.
Keywords :
document image processing; natural language processing; Asian scripts; European scripts; appearance based models; confusing scripts; document script identification; generic script identification; Automation; Character recognition; Computer science; Covariance matrix; Degradation; Europe; Information management; Noise robustness; Principal component analysis; Sorting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4377007
Filename :
4377007
Link To Document :
بازگشت