DocumentCode
2196975
Title
SQMD: Architecture for Scalable, Distributed Database System Built on Virtual Private Servers
Author
Kim, Kangseok ; Pierce, Marlon E. ; Guha, Rajarshi
Author_Institution
Community Grids Lab., Indiana Univ., Bloomington, IN
fYear
2008
fDate
7-12 Dec. 2008
Firstpage
658
Lastpage
665
Abstract
Many scientific fields routinely generate huge datasets. In many cases, these datasets are not static but rapidly grow in size. Handling these types of datasets, as well as allowing sophisticated queries necessitates scalable distributed database systems, in which scientists are efficiently able to search the datasets. In this paper we present the architecture, implementation and performance analysis of a scalable, distributed database system built on software based virtualization environments. The system architecture makes use of a software partitioning of the database based on data clustering, SQMD (single query multiple database) mechanism, a Web service interface, and virtualization software technologies. The system allows uniform access to concurrently distributed databases, using the SQMD mechanism based on the publish/subscribe paradigm. We highlight the scalability of our architecture by applying it to a database of 17 million chemical structures. In addition to simple identifier based retrieval, we will present performance results for shape similarity queries, which is extremely, time intensive with traditional architectures.
Keywords
data visualisation; distributed databases; pattern clustering; query processing; software architecture; user interfaces; virtual private networks; SQMD architecture; Web service interface; data clustering; identifier based retrieval; publish-subscribe paradigm; scalable distributed database system; single query multiple database mechanism; software partitioning; virtual private servers; virtualization software technologies; Chemical technology; Computer architecture; Database systems; Distributed databases; Performance analysis; Scalability; Service oriented architecture; Software performance; Software systems; Web services; data clustering; distributed database system; virtualization; web service;
fLanguage
English
Publisher
ieee
Conference_Titel
eScience, 2008. eScience '08. IEEE Fourth International Conference on
Conference_Location
Indianapolis, IN
Print_ISBN
978-1-4244-3380-3
Electronic_ISBN
978-0-7695-3535-7
Type
conf
DOI
10.1109/eScience.2008.35
Filename
4736881
Link To Document