Title :
Data Base Analysis Using a Compact Data Set
Author :
Kuri-Morales, Angel Fernando
Author_Institution :
Dept. de Comput., Inst. Tecnol. Autonomo de Mexico, Mexico City, Mexico
fDate :
June 27 2014-July 2 2014
Abstract :
The exploitation of large data bases frequently implies the investment of large and, usually, expensive resources both in terms of the storage and processing time required. It is possible to obtain equivalent reduced data sets where the statistical information of the original data may be preserved while dispensing with redundant constituents. Therefore, the physical embodiment of the relevant features of the data base is more economical. We propose a method where we may obtain an optimal transformed representation of the original data which is, in general, considerably more compact than the original without impairing its informational content.
Keywords :
data reduction; data structures; database management systems; set theory; storage management; compact data set; informational content; large data base analysis; optimal transformed data representation; reduced data sets; redundant constituents; statistical information; Clustering algorithms; Companies; Data mining; Databases; Entropy; Mathematical model; Statistical analysis; compaction; data bases; statistics;
Conference_Titel :
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5056-0
DOI :
10.1109/BigData.Congress.2014.41