DocumentCode
2016486
Title
A Blind Indic Script Recognizer for Multi-script Documents
Author
Pati, Peeta Basa ; Ramakrishnan, A.G.
Author_Institution
Indian Inst. of Sci., Bangalore
Volume
2
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
1248
Lastpage
1252
Abstract
We report a hierarchical blind script identifier for 11 different Indian scripts. An initial grouping of the 11 scripts is accomplished at the first level of this hierarchy. At the subsequent level, we recognize the script in each group. The various nodes of this tree use different feature-classifier combinations. A database of 20,000 words of different font styles and sizes is collected and used for each script. Effectiveness of Gabor and Discrete Cosine Transform features has been independently evaluated using nearest neighbor, linear discriminant and support vector machine classifiers. The minimum and maximum accuracies obtained, using this hierarchical mechanism, are 92.2% and 97.6%, respectively.
Keywords
discrete cosine transforms; document image processing; image classification; natural language processing; support vector machines; Gabor transform; blind Indie script recognizer; discrete cosine transform; feature-classifier combinations; linear discriminant classifiers; multi-script documents; nearest neighbor classifiers; support vector machine classifiers; Biomedical imaging; Discrete cosine transforms; Filter bank; Frequency; Gabor filters; Laboratories; Nearest neighbor searches; Spatial databases; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4377115
Filename
4377115
Link To Document