• DocumentCode
    1685747
  • Title

    D3G: novel approaches to data statistics, understanding and preprocessing on the grid

  • Author

    Wöhrer, Alexander ; Nováková, Lenka ; Brezany, Peter ; Tjoa, A. Min

  • Author_Institution
    Inst. of Sci. Comput., Univ. of Vienna, Austria
  • Volume
    1
  • fYear
    2006
  • Abstract
    Relocating the code for data preprocessing (DPP) closer towards the data source is the overall task of the D3G framework (data statistics, data understanding, data preprocessing on the grid), developed within a joint project of the University of Vienna, the Vienna University of Technology and the Czech Technical University. This paper presents the data service side architecture to gather data statistics on-the-fly and use them in remote DPP methods on query results as well as an approach to gather exact continuous data statistics for whole tables in a database on the grid. The performance results of our prototype implementation are showing low running costs for the continuous data statistics inside the database and also the feasibility of our proposed data service side functionality.
  • Keywords
    data analysis; data mining; grid computing; query processing; relational databases; D3G framework; RDBMS; data mining; data preprocessing; data service side architecture; data statistics; data understanding; grid computing; query processing; remote DPP methods; Cost function; Cybernetics; Data mining; Data preprocessing; Databases; Interactive systems; Prototypes; Scientific computing; Statistical analysis; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications, 2006. AINA 2006. 20th International Conference on
  • ISSN
    1550-445X
  • Print_ISBN
    0-7695-2466-4
  • Type

    conf

  • DOI
    10.1109/AINA.2006.137
  • Filename
    1620210