Title :
Information Management System Using Structure Analysis of Paper/Electronic Documents and Its Applications
Author :
Seki, Minenobu ; Fujio, Masakazu ; Nagasaki, Takeshi ; Shinjo, Hiroshi ; Marukawa, Katsumi
Author_Institution :
Hitachi, Ltd., Tokyo
Abstract :
An information management system using analyzing document structure is presented. The purpose is simultaneous management of information in various paper and electronic documents. The system contains image document analysis, PDF document analysis, and HTML document analysis. The two applications are presented and the developed prototypes are described. One application is document summarization. The other application is table understanding to correlate data to items.
Keywords :
document image processing; text analysis; HTML document analysis; PDF document analysis; document summarization; image document analysis; information management system; paper-electronic document structure analysis; table understanding; HTML; Image analysis; Image converters; Information analysis; Information management; Laboratories; Prototypes; Text analysis; World Wide Web; XML;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4377003