Title :
Hybrid fragment mining with MoFa and FSG
Author :
Meinl, Thorsten ; Berthold, Michael R.
Author_Institution :
Dept. of Comput. Sci., Erlangen-Nuremberg Univ., Erlangen, Germany
Abstract :
In the past few years a number of different subgraph mining algorithms have been proposed. They are often used for ending frequent fragments in molecular databases. All these algorithms behave quite well when used on small datasets of not more than a few thousand molecules. However they all fail on larger amounts of data because they are either time consuming or have enormous memory requirements. We present a hybrid mining technique that overcomes the individual problems of the underlying algorithms and outperforms the individual methods impressively on large databases.
Keywords :
biochemistry; biology computing; data mining; very large databases; FSG; MoFa; hybrid fragment mining; molecular databases; subgraph mining algorithms; Biochemistry; Computer science; Data mining; Databases; Drugs; High temperature superconductors; Information science; Libraries; Logic; Testing;
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8566-7
DOI :
10.1109/ICSMC.2004.1401250