DocumentCode :
1917018
Title :
A Data Management System for Pre-docking in Large-Scale Virtual Screening
Author :
Chen, Jiuqiang ; Zhang, Ruisheng ; Chen, ShiLin ; Li, Lifen ; Zhang, Ying ; Yuan, Chengda ; Li, Lian
Author_Institution :
Eng. Res. Center of Open Source Software & Real-time Syst., Minist. of Educ., Lanzhou Univ., Lanzhou, China
fYear :
2010
fDate :
16-18 July 2010
Firstpage :
109
Lastpage :
114
Abstract :
Virtual screening is a new approach attracting increasing levels of interest in the pharmaceutical industry, as a productive and cost-effective technology in the search for novel lead compounds. The preparation of millions of small molecular compounds is the prerequisite for large-scale virtual screening, and these massive data are usually provided with different format. In addition, scientists often need to select some of them that meet certain conditions. Therefore, an efficient data management approach is playing an important role in virtual screening process for managing large-scale small molecular compounds. In this paper, we represent a comprehensive data management framework for pre-docking in large-scale virtual screening. In this framework, we construct a distributed chemical database and utilize parallel processing approach to search certain molecules from the database on the scale of at least several million. We also develop a proxy schema, which is responsible to perform the basic function (such as, splitting large-scale data, update, insert and so on) a collection of multiple, logically interrelated databases distributed over a computer network, meanwhile, we design and establish a rule of splitting large-scale data with optimization. Finally, we simulate and demonstrate a stress test of constructing and searching database. It turns out that our proposal could make the preparing phase of virtual screening process more simple and efficient.
Keywords :
chemical engineering computing; distributed databases; parallel processing; pharmaceuticals; computer network; cost effective technology; data management system; distributed chemical database; large scale data; large scale small molecular compound; large scale virtual screening; optimization; parallel processing approach; pharmaceutical industry; Chemicals; Compounds; Distributed databases; Drugs; Filtering; Relational databases; Chemical Database; Data Management System; Parallel Computing; Virtual Screening;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ChinaGrid Conference (ChinaGrid), 2010 Fifth Annual
Conference_Location :
Guangzhou
Print_ISBN :
978-1-4244-7543-8
Electronic_ISBN :
978-1-4244-7544-5
Type :
conf
DOI :
10.1109/ChinaGrid.2010.40
Filename :
5563017
Link To Document :
بازگشت