DocumentCode :
2877506
Title :
Enhancing and optimizing a data protection solution
Author :
Cherkasova, Ludmila ; Lau, Roger ; Burose, Harald ; Kappler, Bernhard
Author_Institution :
Hewlett-Packard Labs., Palo Alto, CA, USA
fYear :
2009
fDate :
21-23 Sept. 2009
Firstpage :
1
Lastpage :
10
Abstract :
Analyzing and managing large amounts of unstructured information is a high priority task for many companies. For implementing content management solutions, companies need a comprehensive view of their unstructured data. In order to provide a new level of intelligence and control over data resident within the enterprise, one needs to build a chain of tools and automated processes that enable the evaluation, analysis, and visibility into information assets and their dynamics during the information life-cycle. We propose a novel framework to utilize the existing backup infrastructure by integrating additional content analysis routines and extracting already available filesystem metadata over time. This is used to perform data analysis and trending to add performance optimization and self-management capabilities to backup and information management tasks. Backup management faces serious challenges on its own: processing ever increasing amount of data while meeting the timing constraints of backup windows could require adaptive changes in backup scheduling routines. We revisit a traditional backup job scheduling and demonstrate that random job scheduling may lead to inefficient backup processing and an increased backup time. In this work, we use a historic information about the object backup processing time and suggest an additional job scheduling, and automated parameter tuning which may significantly optimize the overall backup time. Under this scheduling, called LBF, the longest backups (the objects with longest backup time) are scheduled first. We evaluate the performance benefits of the introduced scheduling using a realistic workload collected from the seven backup servers at HP Labs. Significant reduction of the backup time (up to 30%) and improved quality of service can be achieved under the proposed job assignment policy.
Keywords :
business data processing; content management; scheduling; security of data; automated parameter tuning; automated processes; backup infrastructure; backup job scheduling; backup management; backup scheduling routines; backup servers; backup windows; content analysis routines; content management solution; data analysis; data protection solution; filesystem metadata; information life-cycle; information management task; job assignment policy; longest backup time; object backup processing time; performance optimization; quality of service; self-management capability; unstructured data; Automatic control; Companies; Content management; Data analysis; Data mining; Information analysis; Information management; Intelligent control; Optimization; Protection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems, 2009. MASCOTS '09. IEEE International Symposium on
Conference_Location :
London
ISSN :
1526-7539
Print_ISBN :
978-1-4244-4927-9
Electronic_ISBN :
1526-7539
Type :
conf
DOI :
10.1109/MASCOT.2009.5367043
Filename :
5367043
Link To Document :
بازگشت