DocumentCode
2833810
Title
Gabor filters for document analysis in Indian bilingual documents
Author
Pati, Peeta Basa ; Raju, S. Sabari ; Pati, Nishikanta ; Ramakrishnan, A.G.
Author_Institution
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
fYear
2004
fDate
2004
Firstpage
123
Lastpage
126
Abstract
Reasonable success has been achieved at developing monolingual OCR systems in Indian scripts. Scientists, optimistically, have started to look beyond. Development of bilingual OCR systems and OCR systems with capability to identify the text areas are some of the pointers to future activities in Indian scenario. The separation of text and non-text regions before considering the document image for OCR is an important task. In this paper, we present a biologically inspired, multi-channel filtering scheme for page layout analysis. The same scheme has been used for script recognition as well. Parameter tuning is mostly done heuristically. It has also been seen to be computationally viable for commercial OCR system development.
Keywords
document image processing; filtering theory; natural languages; optical character recognition; text analysis; Gabor filters; Indian bilingual documents; Indian scripts; bilingual OCR systems; document analysis; document image; monolingual OCR systems; multichannel filtering; nontext region separation; optical character recognition systems; page layout analysis; parameter tuning; script recognition; text areas identification; text region separation; Biological system modeling; Character recognition; Databases; Educational institutions; Filtering; Gabor filters; Natural languages; Optical character recognition software; Text analysis; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
Print_ISBN
0-7803-8243-9
Type
conf
DOI
10.1109/ICISIP.2004.1287637
Filename
1287637
Link To Document