DocumentCode :
2717644
Title :
Entropy Quantifiers Useful for Establishing Equivalence between Text Document Images
Author :
Gowda, Sahana D. ; Nagabhushan, P.
Author_Institution :
Univ. of Mysore, Mysore
Volume :
3
fYear :
2007
fDate :
13-15 Dec. 2007
Firstpage :
420
Lastpage :
425
Abstract :
There are many requirements in document image analysis, which warrant understanding the equivalence of document images if possible without OCRing the text contents and in some cases OCRs do not exist. In this paper we propose to employ the entropy notion to´ feel´ the text content in a document image without actually reading it, and hence establish the equivalence or otherwise of two corresponding text components (line/word/character). We introduce Conventional Entropy Quantifier (CEQ) and also define Modified Entropy Quantifier (MEQ) to measure the energy content in the components. The results of experiments performed at line, word and character level are reported. These initial steps in the sequel are expected to establish the equivalence between the two text document images.
Keywords :
document image processing; entropy; optical character recognition; OCRing; document image analysis; energy content; entropy quantifiers; text contents; text document images; Energy measurement; Entropy; Image analysis; Image coding; Image generation; Image recognition; Optical character recognition software; Pixel; Position measurement; Shape;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
Conference_Location :
Sivakasi, Tamil Nadu
Print_ISBN :
0-7695-3050-8
Type :
conf
DOI :
10.1109/ICCIMA.2007.304
Filename :
4426404
Link To Document :
بازگشت