DocumentCode
3521894
Title
Prediction of Protein Quaternary Structure by a Novel Manifold Learning Algorithm
Author
Wang, Tong ; Wang, Jian ; Yao, Lixiu
Author_Institution
Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
fYear
2011
fDate
28-29 May 2011
Firstpage
1
Lastpage
3
Abstract
With the explosion of protein sequences generated in the Post-Genomic Age, it is urgent to develop an automated method to predict protein quaternary structure. To explore this problem, we adopted an approach based on a sequence encoding descriptor by fusing PseAA (Pseudo Amino Acid) and DC (Dipeptide Composition) representing a protein sample. Here, a completely different approach, manifold learning algorithm MVP (Maximum variance projection) is introduced to extract the key features from the high-dimensional feature space. The dimension-reduced descriptor vector thus obtained is a compact representation of the original high dimensional vector. Our jackknife test results indicate that it is very promising to use the dimensionality reduction approaches to cope with complicated problems in biological systems, such as predicting the quaternary structure of proteins.
Keywords
bioinformatics; biological techniques; data reduction; feature extraction; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; MVP; PseAA-DC fusion; automated protein quaternary structure prediction; dimension reduced descriptor vector; dimensionality reduction approach; dipeptide composition; feature extraction; high dimensional feature space; manifold learning algorithm; maximum variance projection; protein sequences; pseudo-amino acid; sequence encoding descriptor; Amino acids; Biological systems; Classification algorithms; Feature extraction; Prediction algorithms; Proteins; Support vector machine classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems and Applications (ISA), 2011 3rd International Workshop on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-9855-0
Electronic_ISBN
978-1-4244-9857-4
Type
conf
DOI
10.1109/ISA.2011.5873419
Filename
5873419
Link To Document