Title :
A decision graph explanation of protein secondary structure prediction
Author :
Dowe, David L. ; Oliver, Jonathan ; Dix, Trevor I. ; Allison, Lloyd ; Wallace, Christopher S.
Author_Institution :
Dept. of Comput. Sci., Monash Univ., Clayton, Vic., Australia
Abstract :
The machine-learning technique of decision graphs, a generalization of decision trees, is applied to the prediction of protein secondary structure to infer a theory for this problem. The resulting decision graph provides both a prediction method and an explanation for the problem. Many decision graphs are possible for the problem. A particular graph is just one theory or hypothesis of secondary structure formation. Minimum message length encoding is used to judge the quality of different theories. It is a general technique of inductive inference and is resistant to learning the noise in the training data. The method was applied to 75 sequences from nonhomologous proteins comprising 13 K amino acids. The predictive accuracy for three states (extended, helix, other) was in the range achieved by current methods.
Keywords :
biology computing; decision theory; graph theory; inference mechanisms; learning (artificial intelligence); macromolecular configurations; physics computing; proteins; amino acids; decision graph; inductive inference; machine-learning; minimum message length encoding; noise resistant technique; nonhomologous proteins; predictive accuracy; protein secondary structure prediction; training data; Accuracy; Amino acids; Biology computing; Computer science; Decision trees; Encoding; Machine learning; Prediction methods; Proteins; Sequences; Shape; Training data;
Conference_Titel :
System Sciences, 1993, Proceeding of the Twenty-Sixth Hawaii International Conference on
Print_ISBN :
0-8186-3230-5
DOI :
10.1109/HICSS.1993.270674