DocumentCode :
3070431
Title :
Mining Molecular Datasets on Symmetric Multiprocessor Systems
Author :
Meinl, Thorsten ; Worlein, Marc ; Fischer, Ingrid ; Philippsen, Michael
Author_Institution :
Konstanz Univ., Konstanz
Volume :
2
fYear :
2006
fDate :
8-11 Oct. 2006
Firstpage :
1269
Lastpage :
1274
Abstract :
Although in the last few years about a dozen sophisticated algorithms for mining frequent fragments in molecular databases have been proposed, searching big databases with 100,000 compounds and more is still a time-consuming process. Even the currently fastest algorithms like gSpan, FFSM, Gaston, or MoFa require hours to complete their tasks. This paper presents thread-based parallel versions of MoFa [5] and gSpan [26] that achieve speedups up to 11 on a shared-memory SMP system using 12 processors. We discuss the design space of the parallelization, the results, and the obstacles that are caused by the irregular search space and by the current state of Java technology.
Keywords :
Java; biology computing; data mining; multiprocessing systems; FFSM; Gaston; Java technology; MoFa; frequent fragments; gSpan; molecular databases; molecular dataset mining; symmetric multiprocessor systems; Association rules; Cybernetics; Data mining; Databases; Grid computing; Java; Lattices; Multiprocessing systems; Space technology; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
Type :
conf
DOI :
10.1109/ICSMC.2006.384889
Filename :
4274023
Link To Document :
بازگشت