DocumentCode
258964
Title
Entropy Computations of Document Images in Run-Length Compressed Domain
Author
Nagabhushan, P. ; Javed, Muhammad ; Chaudhuri, Bidyut B.
Author_Institution
Dept. of Studies in Comput. Sci., Univ. of Mysore, Mysore, India
fYear
2014
fDate
8-10 Jan. 2014
Firstpage
287
Lastpage
291
Abstract
Compression of documents, images, audios and videos have been traditionally practiced to increase the efficiency of data storage and transfer. However, in order to process or carry out any analytical computations, decompression has become an unavoidable pre-requisite. In this research work, we have attempted to compute the entropy, which is an important document analytic directly from the compressed documents. We use Conventional Entropy Quantifier (CEQ) and Spatial Entropy Quantifiers (SEQ) for entropy computations [1]. The entropies obtained are useful in applications like establishing equivalence, word spotting and document retrieval. Experiments have been performed with all the data sets of [1], at character, word and line levels taking compressed documents in run-length compressed domain. The algorithms developed are computational and space efficient, and results obtained match 100% with the results reported in [1].
Keywords
computational complexity; data compression; document image processing; entropy; CEQ; SEQ; compressed documents; computational efficient algorithms; conventional entropy quantifier; data storage; data transfer; document analytics; document image entropy computations; document retrieval; run-length compressed domain; space efficient algorithms; spatial entropy quantifiers; word spotting; Data mining; Energy measurement; Entropy; Feature extraction; Image coding; Position measurement; Entropy; compressed documents; compressed domain processing; run-length compression;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Image Processing (ICSIP), 2014 Fifth International Conference on
Conference_Location
Jeju Island
Type
conf
DOI
10.1109/ICSIP.2014.51
Filename
6754890
Link To Document