Title :
Chinese document image retrieval system based on proportion of black pixel area in a character image
Author :
Ching-Lin Wang ; Cher, T. ; Yung-Kuan Chan ; Ren-Hung Hwang ; Wan-Wen Huang
Author_Institution :
NationaI Chung Cheng University
Abstract :
In order to preserve the original state of a document, a document is usually saved in computer in image format as backup data after a scanner scans it. Presently, many retrieval systems used to deal with this sort of duplicate document images have been proposed, but most of them are only suitable for English duplicate document images. This paper proposes a system for Chinese duplicate document images, which uses the proportion of black pixel area in each character image as the feature of this character image. According to experimental results, the proposed system can efficiently find out the desired duplicate document image.
Keywords :
Character recognition; Computer science; Image databases; Image retrieval; Image segmentation; Information management; Information retrieval; Optical character recognition software; Pixel; Spatial databases; Dynamic Programming; LCS (Longest Common Subsequence); OCR (optical character recognition); character segmentation; document image; image matching;
Conference_Titel :
Advanced Communication Technology, 2004. The 6th International Conference on
Conference_Location :
Phoenix Park, Korea
Print_ISBN :
89-5519-119-7
DOI :
10.1109/ICACT.2004.1292823