Title :
Analysis and visualization of category membership distribution in multivariate data
Author :
Pao, Y.-H. ; Duan, B.F. ; Zhao, Y.L. ; LeClair, S.R.
Author_Institution :
Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
This paper reports on some advances in generic data processing procedures with focus on a specific materials discovery and design task. The task is to predict whether a new ternary materials system would be compound forming or not, with the prediction to be based on knowledge of many other known exemplars. The activities and results of three related efforts are described. In one effort, using a combination of clustering and mapping procedures, an accuracy of more than 99% was attained in predicting the category status of new ternary systems. A second effort addressed the question of how to identify redundant or superfluous features. A procedure for identifying the extent of functional dependency amongst features was developed. A third effort addressed the question of how to obtain reduced dimension representations of multivariate data, albeit at the cost of loss of some information. Visualizations of low-dimensional representations can be helpful in building up holistic views of data space, for use in exploration and discovery of new materials
Keywords :
chemical engineering computing; data visualisation; materials science; category membership distribution; cluster analysis; compound forming; data visualization; materials design; materials discovery; multivariate data; regional analysis; ternary systems; Costs; Data analysis; Data visualization; Large Hadron Collider;
Conference_Titel :
Intelligent Processing and Manufacturing of Materials, 1999. IPMM '99. Proceedings of the Second International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
0-7803-5489-3
DOI :
10.1109/IPMM.1999.791565