Title :
Deduction of metadata for tabular structures in plain text
Author :
Hodge, I.E. ; Gray, W.A. ; Fiddian, N.J.
Author_Institution :
Dept. of Comput. Sci., Wales Univ., Cardiff, UK
Abstract :
Tables are a very effective way of presenting information. Because of their flexibility and ease of reading, tables have long been used as a method of presenting information in a clear and concise manner. With the introduction of the Internet, large bases of unstructured text have become available, many of which contain information in a tabular form. There is a growing awareness of this information and the need to link it with more conventionally held information in other applications so that its full potential can be realised. The problem with tabular information is that although the nature of its presentation makes it easy for the reader to understand, it is not possible to access and utilise this information automatically in a computer application and so link it with other information. Our aim is to develop a set of tools that will allow us to locate, extract and process tables from plain text documents in order to reuse the potentially valuable information they contain
Keywords :
content-based retrieval; Internet; metadata deduction; plain text; plain text documents; tabular information; tabular structures;
Conference_Titel :
Multimedia Databases and MPEG-7 (Ref. No. 1999/056), IEE Colloquium on
Conference_Location :
London
DOI :
10.1049/ic:19990308