Title :
Recognizing patterns in high-dimensional data: automated histogram filtering for protein structure elucidation
Author :
Imada, Janine ; Chapman, Paul ; Rothstein, Stuart M.
Author_Institution :
Brock Univ., St. Catharines, Ont., Canada
Abstract :
Advanced automated clustering algorithms are essential to meet the challenge of understanding high-dimensional data that arise in protein structure elucidation. In this paper, we first describe a powerful new approach to cluster analysis, automated histogram filtering (AHF). We follow this by a review of applications of AHF-clustering of Monte Carlo and molecular dynamics simulation data for off-lattice and all-atom model protein systems. Next, we present the results of a time-complexity analysis performed on a computer cluster suggesting how the computational effort scales with the size of the protein and the number of processors. And finally, we conclude with a summary of our findings.
Keywords :
Monte Carlo methods; biology computing; computational complexity; pattern clustering; principal component analysis; proteins; Monte Carlo method; automated histogram filtering; cluster analysis; computer cluster; high-dimensional data; molecular dynamics; pattern recognition; protein structure elucidation; time-complexity analysis; Application software; Clustering algorithms; Computational modeling; Filtering; Histograms; Monte Carlo methods; Pattern recognition; Performance analysis; Power system modeling; Proteins;
Conference_Titel :
High Performance Computing Systems and Applications, 2005. HPCS 2005. 19th International Symposium on
Print_ISBN :
0-7695-2343-9
DOI :
10.1109/HPCS.2005.44