DocumentCode
1801314
Title
Scalability issues for high performance digital libraries on the World Wide Web
Author
Andresen, Daniel ; Yang, Tao ; Egecioglu, Omer ; Ibarra, Oscar H. ; Smith, Terence R.
Author_Institution
Dept. of Comput. Sci., California Univ., Santa Barbara, CA, USA
fYear
1996
fDate
13-15, May 1996
Firstpage
139
Lastpage
148
Abstract
We investigate scalability issues involved in developing high performance digital library systems. Our observations and solutions are based on our experience with the Alexandria Digital Library (ADL) testbed under development at UCSB. The current ADL system provides online browsing and processing of digitized maps and other geospatially mapped data via the World Wide Web (WWW). A primary activity of the ADL system involves computation and disk I/O for accessing compressed multi resolution images with hierarchical data structures, as well as other duties such as supporting database queries and on the fly HTML page generation. Providing multi resolution image browsing services can reduce network traffic but impose some additional cost at the server. We discuss the necessity of having a multiprocessor DL server to match potentially huge demands in simultaneous access requests from the Internet. We have developed a distributed scheduling system for processing DL requests, which actively monitors the usages of CPU, I/O channels and the interconnection network to effectively distribute work across processing units to exploit task and I/O parallelism. We present an experimental study on the performance of our scheme in addressing the scalability issues arising in ADL wavelet processing and file retrieval. Our results indicate that the system delivers good performance on these types of tasks
Keywords
Internet; geophysics; information retrieval; library automation; scheduling; special libraries; visual databases; ADL wavelet processing; Alexandria Digital Library; I/O parallelism; WWW; World Wide Web; compressed multi resolution images; database queries; digitized maps; distributed scheduling system; file retrieval; geospatially mapped data; hierarchical data structures; high performance digital libraries; interconnection network; multi resolution image browsing services; multiprocessor DL server; network traffic; on the fly HTML page generation; online browsing; scalability issues; Data structures; Image coding; Image resolution; Network servers; Scalability; Software libraries; Testing; Web server; Web sites; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Libraries, 1996. ADL '96., Proceedings of the Third Forum on Research and Technology Advances in
Conference_Location
Washington, DC
Print_ISBN
0-8186-7403-2
Type
conf
DOI
10.1109/ADL.1996.502524
Filename
502524
Link To Document