• DocumentCode
    249351
  • Title

    Data Base Analysis Using a Compact Data Set

  • Author

    Kuri-Morales, Angel Fernando

  • Author_Institution
    Dept. de Comput., Inst. Tecnol. Autonomo de Mexico, Mexico City, Mexico
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    227
  • Lastpage
    233
  • Abstract
    The exploitation of large data bases frequently implies the investment of large and, usually, expensive resources both in terms of the storage and processing time required. It is possible to obtain equivalent reduced data sets where the statistical information of the original data may be preserved while dispensing with redundant constituents. Therefore, the physical embodiment of the relevant features of the data base is more economical. We propose a method where we may obtain an optimal transformed representation of the original data which is, in general, considerably more compact than the original without impairing its informational content.
  • Keywords
    data reduction; data structures; database management systems; set theory; storage management; compact data set; informational content; large data base analysis; optimal transformed data representation; reduced data sets; redundant constituents; statistical information; Clustering algorithms; Companies; Data mining; Databases; Entropy; Mathematical model; Statistical analysis; compaction; data bases; statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (BigData Congress), 2014 IEEE International Congress on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5056-0
  • Type

    conf

  • DOI
    10.1109/BigData.Congress.2014.41
  • Filename
    6906783