Mining protein data using parallel/distributed association rules

Author

Bahamish, Hesyam Awadh Abdallah ; Salam, Rosalina Abdul ; Abdullah, Rosni ; Osman, Mohd Azarn ; Rashid, Nur´Aini Abdul

Author_Institution

Sch. of Comput. Sci., Universiti Sains Malaysia, Penang, Malaysia

fYear

2004

fDate

19-23 April 2004

Firstpage

461

Lastpage

462

Abstract

Data Mining is used to extract the hidden information from large amount databases. Parallel/distributed computing is used which achieves scalability and improves the performance of compute intensive algorithms. A parallel version of ITL-Mine algorithm is proposed and implemented on distributed memory (shared nothing) architecture. The parallel ITL-Mine algorithm achieved the reduction of the execution time because of the distribution of the data over the processors where each processor worked on its data and communicates with other processors to complete its work. The proposed ITL-mine algorithm can be used in other data mining task such as sequential patterns, max-patterns and frequent closed patterns, classification and clustering.

Keywords

data mining; medical information systems; parallel architectures; proteins; very large databases; ITL-Mine algorithm; compute intensive algorithms; distributed memory architecture; hidden information extraction; large databases; parallel-distributed association rules; protein data mining; Association rules; Clustering algorithms; Computer architecture; Concurrent computing; Data mining; Databases; Distributed computing; Memory architecture; Proteins; Scalability;

fLanguage

English

Publisher

ieee

Conference_Titel

Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on

Print_ISBN

0-7803-8482-2

Type

conf

DOI

10.1109/ICTTA.2004.1307831

Filename

1307831