Title :
Advanced pruning strategies to speed up mining closed molecular fragments
Author :
Borgelt, Christian ; Meinl, Thorsten ; Berthold, Michael R.
Author_Institution :
Sch. of Comput. Sci., Otto von Guericke Univ., Magdeburg, Germany
Abstract :
In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI´s HIV database.
Keywords :
biochemistry; data mining; visual databases; MoFa algorithm; advanced pruning strategies; closed molecular fragment mining; equivalent sibling pruning; frequent subgraphs mining; graph databases; perfect extension pruning; Biochemistry; Computer science; Data mining; Drugs; Ear; Human immunodeficiency virus; Hydrogen; Information science; Protection; Tree graphs;
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8566-7
DOI :
10.1109/ICSMC.2004.1401251