Title :
Qserv: A distributed shared-nothing database for the LSST catalog
Author :
Wang, Daniel L. ; Monkewitz, Serge M. ; Lim, Kian-Tat ; Becla, Jacek
Author_Institution :
SLAC Nat. Accel. Lab., Menlo Park, CA, USA
Abstract :
The LSST project will provide public access to a database catalog that, in its final year, is estimated to include 26 billion stars and galaxies in dozens of trillion detections in multiple petabytes. Because we are not aware of an existing open-source database implementation that has been demonstrated to efficiently satisfy astronomers´ spatial self-joining and cross-matching queries at this scale, we have implemented Qserv, a distributed shared-nothing SQL database query system. To speed development, Qserv relies on two successful open-source software packages: the MySQL RDBMS and the Xrootd distributed file system. We describe Qserv´s design, architecture, and ability to scale to LSST´s data requirements. We illustrate its potential with test results on a 150-node cluster using 55 billion rows and 30 terabytes of simulated data. These results demonstrate the soundness of Qserv´s approach and the scale it achieves on today´s hardware.
Keywords :
SQL; astronomy computing; public domain software; query processing; relational databases; scientific information systems; LSST catalog; MySQL RDBMS; Qserv; Xrootd distributed file system; astronomers; database catalog; distributed shared nothing SQL database query system; galaxies; open source database implementation; public access; stars; Bandwidth; Catalogs; Distributed databases; Hardware; Indexing; Servers; MPP; database; distributed; file system; parallel; shared-nothing;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
Conference_Location :
Seatle, WA
Electronic_ISBN :
978-1-4503-0771-0