• DocumentCode
    2002166
  • Title

    An OLAP data model driven approach to process statistical tables

  • Author

    Luk, Wo-Shun ; Leung, Philip

  • Author_Institution
    Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
  • fYear
    2005
  • fDate
    22-26 Aug. 2005
  • Firstpage
    1054
  • Lastpage
    1058
  • Abstract
    Statistical tables belong to an important subset of tables published in the Web, because they represent up-to-date, vital information sources for decision makers. These tables are often carefully designed for easy reading by analysts, and then mechanically produced by an OLAP database system. The general practice of extracting attribute-value pairs from statistical tables does not ensure high accuracy when they are used as a database for an information retrieval system. In this paper, we show how a human may visualize a statistical table as an multidimensional object, defined by a suitably modified OLAP model. In this way, the keywords are classified into semantically distinct groups, i.e., dimension hierarchies, without any ontological knowledge or resorting to machine learning. A prototype system which mimics the human reasoning for table processing has been implemented. Experiments on 150 randomly chosen tables from statistics Canada have confirmed the validity of this approach.
  • Keywords
    Internet; data mining; data models; query processing; Internet; OLAP database system; attribute-value pair; data model; information retrieval system; statistical table; Data mining; Data models; Database systems; Humans; Information retrieval; Machine learning; Multidimensional systems; Ontologies; Visual databases; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
  • ISSN
    1529-4188
  • Print_ISBN
    0-7695-2424-9
  • Type

    conf

  • DOI
    10.1109/DEXA.2005.144
  • Filename
    1508414