DocumentCode :
3058105
Title :
Image and document processing techniques for the RightPages electronic library system
Author :
O´Gorman, Lawrence
Author_Institution :
AT&T Bell Lab., Murray Hill, NJ, USA
fYear :
1992
fDate :
30 Aug-3 Sep 1992
Firstpage :
260
Lastpage :
263
Abstract :
Describes some of the document processing techniques used in the RightPages electronic library system. Since the system deals with scanned images of document pages, these techniques are critical to the use and appearance of the system. The author describes three techniques: (1) for noise reduction from binary document pages to improve page appearance and subsequent optical character recognition and compression; (2) for subsampling the text image to fit on the computer screen white maintaining readability; and (3) a document layout analysis technique to determine text blocks
Keywords :
data compression; document handling; document image processing; full-text databases; image processing; optical character recognition; RightPages; data compression; document layout analysis; document processing; electronic library system; noise reduction; optical character recognition; readability; scanned image processing; text blocks; text image sampling; Character recognition; Filters; Image analysis; Image databases; Image storage; Libraries; Noise reduction; Optical character recognition software; Optical noise; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2915-0
Type :
conf
DOI :
10.1109/ICPR.1992.201768
Filename :
201768
Link To Document :
بازگشت