DocumentCode
3101264
Title
Mining protein data using parallel/distributed association rules
Author
Bahamish, Hesyam Awadh Abdallah ; Salam, Rosalina Abdul ; Abdullah, Rosni ; Osman, Mohd Azarn ; Rashid, Nur´Aini Abdul
Author_Institution
Sch. of Comput. Sci., Universiti Sains Malaysia, Penang, Malaysia
fYear
2004
fDate
19-23 April 2004
Firstpage
461
Lastpage
462
Abstract
Data Mining is used to extract the hidden information from large amount databases. Parallel/distributed computing is used which achieves scalability and improves the performance of compute intensive algorithms. A parallel version of ITL-Mine algorithm is proposed and implemented on distributed memory (shared nothing) architecture. The parallel ITL-Mine algorithm achieved the reduction of the execution time because of the distribution of the data over the processors where each processor worked on its data and communicates with other processors to complete its work. The proposed ITL-mine algorithm can be used in other data mining task such as sequential patterns, max-patterns and frequent closed patterns, classification and clustering.
Keywords
data mining; medical information systems; parallel architectures; proteins; very large databases; ITL-Mine algorithm; compute intensive algorithms; distributed memory architecture; hidden information extraction; large databases; parallel-distributed association rules; protein data mining; Association rules; Clustering algorithms; Computer architecture; Concurrent computing; Data mining; Databases; Distributed computing; Memory architecture; Proteins; Scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
Print_ISBN
0-7803-8482-2
Type
conf
DOI
10.1109/ICTTA.2004.1307831
Filename
1307831
Link To Document