DocumentCode :
1330761
Title :
Predicting MHC-II Binding Affinity Using Multiple Instance Regression
Author :
EL-Manzalawy, Yasser ; Dobbs, Drena ; Honavar, Vasant
Author_Institution :
Dept. of Syst. & Comput. Eng., Al-Azhar Univ., Cairo, Egypt
Volume :
8
Issue :
4
fYear :
2011
Firstpage :
1067
Lastpage :
1079
Abstract :
Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark data sets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir.
Keywords :
bioinformatics; molecular biophysics; organic compounds; regression analysis; MHC-II binding affinity; amino acid; antigen peptide; immune response; major histocompatibility complex class II; multiple instance regression; pathogenesis; vaccine; Amino acids; Peptides; Prediction algorithms; Prediction methods; Proteins; Shape; Training; MHC-II peptide prediction; multiple instance learning; multiple instance regression.; Amino Acid Sequence; Animals; Area Under Curve; Computational Biology; DNA-Binding Proteins; Genes, MHC Class II; Humans; Mice; Models, Statistical; Molecular Sequence Data; Peptides; Protein Binding; Regression Analysis; Reproducibility of Results; Transcription Factors;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2010.94
Filename :
5582079
Link To Document :
بازگشت