DocumentCode
3019398
Title
Financial document image coding with regions of interest using JPEG2000
Author
Yin, Xu-Cheng ; Liu, Chang-ping ; Han, Zhi
fYear
2005
fDate
29 Aug.-1 Sept. 2005
Firstpage
96
Abstract
Document image coding is a very important issue in document analysis and recognition systems provided with vast samples. An image compression algorithm with regions of interest (ROIs) using JPEG2000 is proposed for financial document images which have various categories, complex layouts, and irregular noises. Three types of ROIs: filled information ROIs, seal ROIs, and handwriting ROIs, are detected and extracted through document knowledge analysis and handwriting identification. The first ROIs are detected by document classification, the second are extracted by connected component analysis based on color and shape information, and the third are located by handwriting identification using an incremental Fisher linear discriminant classifier. A ROI mask with a random shape is constructed by thresholding and merging these ROIs. Finally, a financial document image is encoded using JPEG2000 Part I with this ROI mask. Compared to JPEG and DjVu, the method improves visual quality while decreasing storing space.
Keywords
data compression; document image processing; financial data processing; handwriting recognition; image classification; image coding; image colour analysis; JPEG2000; connected component analysis; document analysis systems; document classification; document knowledge analysis; document recognition systems; financial document image coding; handwriting identification; image color; image compression algorithm; image shape; incremental Fisher linear discriminant classifier; regions of interest; visual quality; Data mining; Image analysis; Image coding; Image recognition; Information analysis; Noise shaping; Seals; Shape; Text analysis; Transform coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN
1520-5263
Print_ISBN
0-7695-2420-6
Type
conf
DOI
10.1109/ICDAR.2005.113
Filename
1575517
Link To Document