DocumentCode :
2833810
Title :
Gabor filters for document analysis in Indian bilingual documents
Author :
Pati, Peeta Basa ; Raju, S. Sabari ; Pati, Nishikanta ; Ramakrishnan, A.G.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
fYear :
2004
fDate :
2004
Firstpage :
123
Lastpage :
126
Abstract :
Reasonable success has been achieved at developing monolingual OCR systems in Indian scripts. Scientists, optimistically, have started to look beyond. Development of bilingual OCR systems and OCR systems with capability to identify the text areas are some of the pointers to future activities in Indian scenario. The separation of text and non-text regions before considering the document image for OCR is an important task. In this paper, we present a biologically inspired, multi-channel filtering scheme for page layout analysis. The same scheme has been used for script recognition as well. Parameter tuning is mostly done heuristically. It has also been seen to be computationally viable for commercial OCR system development.
Keywords :
document image processing; filtering theory; natural languages; optical character recognition; text analysis; Gabor filters; Indian bilingual documents; Indian scripts; bilingual OCR systems; document analysis; document image; monolingual OCR systems; multichannel filtering; nontext region separation; optical character recognition systems; page layout analysis; parameter tuning; script recognition; text areas identification; text region separation; Biological system modeling; Character recognition; Databases; Educational institutions; Filtering; Gabor filters; Natural languages; Optical character recognition software; Text analysis; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
Print_ISBN :
0-7803-8243-9
Type :
conf
DOI :
10.1109/ICISIP.2004.1287637
Filename :
1287637
Link To Document :
بازگشت