Title :
Frequent graph mining and its application to molecular databases
Author :
Nijssen, Siegfried ; Kok, Joost N.
Author_Institution :
LIACS, Universiteit Leiden, Netherlands
Abstract :
Molecular fragment mining is a promising approach for discovering novel fragments for drugs. We investigate a method for mining fragments which consists of three phases: first, a preprocessing phase for turning molecular databases into graph databases; second, the Gaston frequent graph mining phase for mining frequent paths, free trees and cyclic graphs; and third, a postprocessing phase in which redundant frequent fragments are removed. We devote most of our attention to the frequent graph mining phase, as this phase is computationally the most demanding, but also look at the other phases.
Keywords :
biochemistry; biology computing; computational complexity; data mining; database management systems; graph theory; Gaston frequent graph mining phase; cyclic graph mining; free tree mining; frequent graph mining; graph databases; molecular databases; molecular fragment mining; preprocessing phase; Algorithm design and analysis; Data analysis; Data mining; Deductive databases; Drugs; Encoding; Libraries; Polynomials; Spatial databases; Tree graphs;
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8566-7
DOI :
10.1109/ICSMC.2004.1401252