DocumentCode
249351
Title
Data Base Analysis Using a Compact Data Set
Author
Kuri-Morales, Angel Fernando
Author_Institution
Dept. de Comput., Inst. Tecnol. Autonomo de Mexico, Mexico City, Mexico
fYear
2014
fDate
June 27 2014-July 2 2014
Firstpage
227
Lastpage
233
Abstract
The exploitation of large data bases frequently implies the investment of large and, usually, expensive resources both in terms of the storage and processing time required. It is possible to obtain equivalent reduced data sets where the statistical information of the original data may be preserved while dispensing with redundant constituents. Therefore, the physical embodiment of the relevant features of the data base is more economical. We propose a method where we may obtain an optimal transformed representation of the original data which is, in general, considerably more compact than the original without impairing its informational content.
Keywords
data reduction; data structures; database management systems; set theory; storage management; compact data set; informational content; large data base analysis; optimal transformed data representation; reduced data sets; redundant constituents; statistical information; Clustering algorithms; Companies; Data mining; Databases; Entropy; Mathematical model; Statistical analysis; compaction; data bases; statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location
Anchorage, AK
Print_ISBN
978-1-4799-5056-0
Type
conf
DOI
10.1109/BigData.Congress.2014.41
Filename
6906783
Link To Document