DocumentCode :
1815974
Title :
Information Valuation for Information Lifecycle Management
Author :
Chen, Ying
Author_Institution :
Dept. of Storage Syst., IBM Almaden Res. Center, San Jose, CA
fYear :
2005
fDate :
13-16 June 2005
Firstpage :
135
Lastpage :
146
Abstract :
The foremost crucial step towards a fully automated information lifecycle management (ILM) is to differentiate information by values in an unbiased manner and understand how values change over time. This paper presents an information valuation approach that quantifies the value of a given piece of information based on its usage over time. Our case study based on several real world NFS file server traces collected from Harvard University shows that such a model is simple, effective, and tangible since it relies on measurable metrics and observable facts. It captures the changing nature of the file value throughout their lifecycles, reflects the value differences among different files, and hence allows one to compare and classify files. More importantly, through additional analysis of the model outputs one can gain new insights into files, e.g., what files are most valuable and when. We show that files in different value classes exhibit different characteristics and can be characterized by unique sets of attributes. By devising algorithms to extract such attributes automatically for different classes of files, storage systems can predict what class a file would belong to early in its lifecycle, e.g., at the creation time. The file valuation, classification, and class membership prediction can then guide a wide range of new optimizations, e.g., data placement across tiered storage and data protection
Keywords :
classification; data structures; information management; middleware; storage management; Harvard University; NFS file server; data placement; data protection; file attribute extraction; file class membership; file classification; file lifecycle; file valuation; information differentiation; information lifecycle management; information valuation; storage systems; tiered storage; trace collection; Availability; Cost accounting; Data mining; Databases; File servers; Information management; Protection; Resource management; Storage automation; US Department of Defense;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Autonomic Computing, 2005. ICAC 2005. Proceedings. Second International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7965-2276-9
Type :
conf
DOI :
10.1109/ICAC.2005.35
Filename :
1498059
Link To Document :
بازگشت