• DocumentCode
    2410760
  • Title

    Some analyses of interval data

  • Author

    Billard, Lynne

  • Author_Institution
    Dept. of Stat., Univ. of Georgia, Athens, GA
  • fYear
    2008
  • fDate
    23-26 June 2008
  • Firstpage
    3
  • Lastpage
    11
  • Abstract
    Contemporary computers bring us very large datasets, datasets which can be too large for those same computers to analyse properly. One approach is to aggregate these data (by some suitably scientific criteria) to provide more manageably-sized datasets. These aggregated data will perforce be symbolic data consisting of lists, intervals, histograms, etc. Now an observation is a p-dimensional hypercube or Cartesian product of p distributions in Rp, instead of the p-dimensional point in in Rp of classical data. Other data can be naturally symbolic. We give a brief overview of interval-valued data and show briefly that it is important to use symbolic analysis methodology since, e.g., analyses based on classical surrogates ignore some of the information in the dataset.
  • Keywords
    data analysis; statistical distributions; symbol manipulation; very large databases; Cartesian product; data aggregation; interval-valued data analysis; p distributions; p-dimensional hypercube; symbolic data analysis methodology; very large datasets; Aggregates; Data analysis; Data mining; Histograms; History; Hospitals; Hypercubes; Information analysis; Random variables; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology Interfaces, 2008. ITI 2008. 30th International Conference on
  • Conference_Location
    Dubrovnik
  • ISSN
    1330-1012
  • Print_ISBN
    978-953-7138-12-7
  • Electronic_ISBN
    1330-1012
  • Type

    conf

  • DOI
    10.1109/ITI.2008.4588377
  • Filename
    4588377