DocumentCode
2002166
Title
An OLAP data model driven approach to process statistical tables
Author
Luk, Wo-Shun ; Leung, Philip
Author_Institution
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
fYear
2005
fDate
22-26 Aug. 2005
Firstpage
1054
Lastpage
1058
Abstract
Statistical tables belong to an important subset of tables published in the Web, because they represent up-to-date, vital information sources for decision makers. These tables are often carefully designed for easy reading by analysts, and then mechanically produced by an OLAP database system. The general practice of extracting attribute-value pairs from statistical tables does not ensure high accuracy when they are used as a database for an information retrieval system. In this paper, we show how a human may visualize a statistical table as an multidimensional object, defined by a suitably modified OLAP model. In this way, the keywords are classified into semantically distinct groups, i.e., dimension hierarchies, without any ontological knowledge or resorting to machine learning. A prototype system which mimics the human reasoning for table processing has been implemented. Experiments on 150 randomly chosen tables from statistics Canada have confirmed the validity of this approach.
Keywords
Internet; data mining; data models; query processing; Internet; OLAP database system; attribute-value pair; data model; information retrieval system; statistical table; Data mining; Data models; Database systems; Humans; Information retrieval; Machine learning; Multidimensional systems; Ontologies; Visual databases; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
ISSN
1529-4188
Print_ISBN
0-7695-2424-9
Type
conf
DOI
10.1109/DEXA.2005.144
Filename
1508414
Link To Document