DocumentCode :
2092901
Title :
Globally Distributed BookPrep - Open Crirrus-Hosted Service for Book Preparation
Author :
Reddy, Prakash ; Dudekula, Shariff ; Puthanveedu, Susanth ; Milojicic, Dejan
Author_Institution :
HP Imaging & Printing, Palo Alto, CA, USA
fYear :
2011
fDate :
12-13 Oct. 2011
Firstpage :
11
Lastpage :
16
Abstract :
BookPrep is a Print-On-Demand service that takes raw scans and converts them to print-ready files. It requires large amount of storage and takes an average of 5 hours of CPU time to process a single book with about 300 pages. The experiment we conducted involved moving the processing of books on Open Cirrus closer to the location of the data. At three Open Cirrus sites we installed BookPrep service and we pre-populated each site with region-specific scanned books. When requests come in to process a book, each request is routed to the compute node closest to the source data. The compute node is then expected to store the processed data on the same network. The compute nodes are allocated and deallocated based on demand. There is a cloud based metadata repository that is used to update the metadata associated with each book regardless of the location of the source and derived data. The goal of this experiment is to determine if performance can be improved by moving book processing close closer to source data location. The fundamental reason behind the success of MapReduce is the notion of moving compute close to data and we would like to see if that same principal can be applied to a pull based scheduling model.
Keywords :
Web services; cloud computing; electronic publishing; scheduling; MapReduce; Web services; book preparation; book processing; cloud based metadata repository; cloud computing; compute node allocation; compute node deallocation; data location; globally distributed BookPrep; open cirrus sites; open cirrus-hosted service; print-on-demand service; print-ready files; pull based scheduling model; region-specific scanned books; source data location; Bandwidth; Clouds; Computer architecture; Pipelines; Portals; Printing; Scalability; Clouds; Imaging and printing; Web services; distribution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Open Cirrus Summit (OCS), 2011 Sixth
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4673-0727-7
Type :
conf
DOI :
10.1109/OCS.2011.8
Filename :
6200547
Link To Document :
بازگشت