DocumentCode :
3142413
Title :
Preattentive reading and selective attention for document image analysis
Author :
Faure, Claudie
Author_Institution :
ENST-TSI, CNRS, Paris, France
fYear :
1999
fDate :
20-22 Sep 1999
Firstpage :
577
Lastpage :
580
Abstract :
PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The approach is based on the combination of perception and symbol reading which are the two processes involved when humans detect the organisation of a document. “Preattentive reading” denotes the physical segmentation related to perceptual organisation. “Selective attention” means that symbol reading is limited to specific sequences of symbols or to pre-attentively selected locations. An OCR provides the primary structured description of the document. PixED improves the quality of this description, completes the physical segmentation and adds a logical description. A distributed software architecture and an incremental strategy are defined to enable the integration of perception and symbol reading. The approach is tested on a set of documents composed of several pages which are gathered from proceedings of scientific conferences
Keywords :
distributed processing; document image processing; image segmentation; information retrieval; optical character recognition; software architecture; text analysis; OCR; PixED; distributed software architecture; document image analysis; document image conversion; document organisation detection; information retrieval; logical description; perception; physical segmentation; pre-attentively selected locations; preattentive reading; scientific conferences; selective attention; structured description; structured electronic documents; symbol reading; symbol sequences; Humans; Image analysis; Image segmentation; Image sequence analysis; Information retrieval; Optical character recognition software; Pixel; Software architecture; Text analysis; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
Type :
conf
DOI :
10.1109/ICDAR.1999.791853
Filename :
791853
Link To Document :
بازگشت