Title of article :
Fuzzy clustering of 627 alcohols, guided by a strategy for cluster analysis of chemical compounds for combinatorial chemistry
Author/Authors :
Linusson، نويسنده , , Anna and Wold، نويسنده , , Svante and Nordén، نويسنده , , Bo، نويسنده ,
Issue Information :
دوفصلنامه با شماره پیاپی سال 1998
Abstract :
A strategy for cluster analysis of chemical compounds for combinatorial chemistry is presented in this paper and applied to a set of 627 alcohols. The alcohols are characterised by 50 semi-empirical descriptors and the resulting 627×50 table was compressed by PCA. The method used to investigate the groupings was fuzzy clustering, using the fuzzy c-means algorithm. This technique allows a compound to belong to more than one group. Different values of the fuzziness coefficients were used and two different distances were incorporated in the algorithm, the traditional Euclidean distance and the Mahalanobis distance. The latter takes correlations within a group into account and can hence deal with elongated clusters. The resulted membership matrices were validated by PLS regression. The models created were used to verify statistical and chemical relevance of the formed clusters. The results showed that the Mahalanobis distance and a fuzziness coefficient of 1.2 should be used for an optimal clustering. The coefficients from the PLS models were further used for chemical interpretation of the groups. The four groups were chemically interpretable and consistent. The first group contained large flexible molecules, the second contained more polar compounds, the third contained molecules with two or more aromatic rings fused together, and the forth contained small and relatively reactive molecules. Molecules that did not fit into any of the groups, i.e., singletons, were flagged as outliers in the PLS models.
Keywords :
Cluster analysis , 627 alcohols , Fuzzy clustering , Combinatorial chemistry , PLS
Journal title :
Chemometrics and Intelligent Laboratory Systems
Journal title :
Chemometrics and Intelligent Laboratory Systems