DocumentCode
3248656
Title
Mining molecular fragments: finding relevant substructures of molecules
Author
Borgelt, Christian ; Berthold, Michael R.
Author_Institution
Sch. of Comput. Sci., Univ. of Magdeburg, Germany
fYear
2002
fDate
2002
Firstpage
51
Lastpage
58
Abstract
We present an algorithm to find fragments in a set of molecules that help to discriminate between different classes of for instance, activity in a drug discovery context. Instead of carrying out a brute-force search, our method generates fragments by embedding them in all appropriate molecules in parallel and prunes the search tree based on a local order of the atoms and bonds, which results in substantially faster search by eliminating the need for frequent, computationally expensive reembeddings and by suppressing redundant search. We prove the usefulness of our algorithm by demonstrating the discovery of activity-related groups of chemical compounds in the well-known National Cancer Institute´s HIV-screening dataset.
Keywords
biochemistry; biology computing; chemical structure; data mining; molecular biophysics; pharmaceutical industry; association rule mining; bioinformatics; data mining; drug discovery; molecules; search strategy; Association rules; Atomic measurements; Bioinformatics; Cancer; Chemical compounds; Computer science; Data analysis; Data mining; Drugs; Electronic mail;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-1754-4
Type
conf
DOI
10.1109/ICDM.2002.1183885
Filename
1183885
Link To Document