DocumentCode :
3246665
Title :
An approach to intelligent information filtering in Chinese document images based on garbage model
Author :
Jiewei, Chen ; Weiran, Xu ; Jun, Guo
Author_Institution :
Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., China
fYear :
2004
fDate :
20-22 Oct. 2004
Firstpage :
198
Lastpage :
201
Abstract :
A fast approach to Chinese document image filtering is presented. Garbage models are built by keyword clustering prior to keyword searching. The retrieval process is accelerated by the Boyer-Moore algorithm. A character is classified as accepted or rejected by the distance from the garbage models. A confidence measure ensures precision. Document vectors are built, based on keyword spotting from the document image. We obtain the score of the document image by means of a vector space model. Experimental results confirmed the robustness of the proposed approach over a wide range of degradations.
Keywords :
document image processing; feature extraction; image retrieval; image segmentation; information filtering; optical character recognition; Boyer-Moore algorithm; Chinese document image filtering; OCR; character garbage model distance; confidence measure; feature extraction; image retrieval process; image segmentation; intelligent information filtering; keyword spotting; vector space model; Acceleration; Character recognition; Image recognition; Image segmentation; Information filtering; Information filters; Information retrieval; Keyword search; Optical character recognition software; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on
Print_ISBN :
0-7803-8687-6
Type :
conf
DOI :
10.1109/ISIMP.2004.1434034
Filename :
1434034
Link To Document :
بازگشت