• DocumentCode
    3101264
  • Title

    Mining protein data using parallel/distributed association rules

  • Author

    Bahamish, Hesyam Awadh Abdallah ; Salam, Rosalina Abdul ; Abdullah, Rosni ; Osman, Mohd Azarn ; Rashid, Nur´Aini Abdul

  • Author_Institution
    Sch. of Comput. Sci., Universiti Sains Malaysia, Penang, Malaysia
  • fYear
    2004
  • fDate
    19-23 April 2004
  • Firstpage
    461
  • Lastpage
    462
  • Abstract
    Data Mining is used to extract the hidden information from large amount databases. Parallel/distributed computing is used which achieves scalability and improves the performance of compute intensive algorithms. A parallel version of ITL-Mine algorithm is proposed and implemented on distributed memory (shared nothing) architecture. The parallel ITL-Mine algorithm achieved the reduction of the execution time because of the distribution of the data over the processors where each processor worked on its data and communicates with other processors to complete its work. The proposed ITL-mine algorithm can be used in other data mining task such as sequential patterns, max-patterns and frequent closed patterns, classification and clustering.
  • Keywords
    data mining; medical information systems; parallel architectures; proteins; very large databases; ITL-Mine algorithm; compute intensive algorithms; distributed memory architecture; hidden information extraction; large databases; parallel-distributed association rules; protein data mining; Association rules; Clustering algorithms; Computer architecture; Concurrent computing; Data mining; Databases; Distributed computing; Memory architecture; Proteins; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
  • Print_ISBN
    0-7803-8482-2
  • Type

    conf

  • DOI
    10.1109/ICTTA.2004.1307831
  • Filename
    1307831