DocumentCode
659607
Title
KUChemBio: A repository of computational chemical biology data sets
Author
Hall, Aaron Smalter ; Jun Huan
Author_Institution
Mol. Graphics & Modeling Lab., Univ. of Kansas, Lawrence, KS, USA
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
37
Lastpage
42
Abstract
Data set curation in cheminformatics is largely ignored, and many publications do not provide the specific chemical structures used in their experiments. Access to chemical structures is vital for experiment reproducibility and comparison of competing methods. To address this limitation, the KU Chemical Biology Database (KUChemBio) has established a collection of 69 data sets for computational chemical biology experiments. Data sets fall into several categories including ADME, toxicity, binding affinity, solubility, melting points, and others. Chemical structures in SDF or Smiles format are provided along with binary or real valued activity labels. Data sets have been consolidated from other online repositories and content from recent publications has been added as well. KUChemBio is located at http://bcf.ku.edu/kuchembio.
Keywords
chemistry computing; data handling; ADME category; KU chemical biology database; KUChemBio; SDF format; Smiles format; binding affinity category; chemical structures; cheminformatics; computational chemical biology data sets; data set curation; experiment reproducibility; melting points category; solubility category; toxicity category; Absorption; Biology; Chemicals; Compounds; Computational modeling; Drugs; Inhibitors;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691756
Filename
6691756
Link To Document