Title :
Sequential and parallel implementation of a constraint-based algorithm for searching protein structures
Author :
Hunold, Sascha ; Rauber, Thomas ; Wille, Georg
Author_Institution :
Dept. of Math. & Phys., Bayreuth Univ., Bayreuth
Abstract :
Data mining in biological structure libraries can be a powerful tool to better understand biochemical processes. This article introduces the LISA algorithm which enables the researcher to search substructures in PDB files describing the 3D structure of protein molecules. The use of constraints such as atomic distances, torsion angles, or the distance of residues within the linear amino acid sequence, allows for great flexibility in defining and searching specific structures, which could not be found with other tools. Data mining in biological databases, e.g. scanning the entire PDB database for structures that match user-defined criteria, is a massively computation-intensive task. Thus, we present a parallel implementation of LISA and show that the algorithm achieves good parallel efficiency on homogeneous clusters.
Keywords :
biochemistry; biology computing; data mining; molecular biophysics; parallel algorithms; pattern clustering; pattern matching; proteins; scientific information systems; search problems; LISA algorithm; PDB file searching; biochemical process; biological database; biological structure library; constraint-based algorithm; data mining; linear amino acid sequence; parallel implementation; pattern clustering; pattern matching; protein molecule 3D structure searching; sequential implementation; Amino acids; Bioinformatics; Data mining; Databases; Information analysis; Libraries; Mathematics; Physics; Proteins; Sequences;
Conference_Titel :
Cluster Computing, 2007 IEEE International Conference on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-1387-4
Electronic_ISBN :
1552-5244
DOI :
10.1109/CLUSTR.2007.4629254