DocumentCode
2340720
Title
Learning from genome sequences utilizing computational intelligence
Author
Yang, Jack Y. ; Yang, Mary Qu ; Ersoy, Okan
Author_Institution
Harvard Med. Sch., Harvard Univ., Boston, MA
fYear
0
fDate
0-0 0
Abstract
Advances in genome sequencing technology have led to an exploration in the amount of sequence data available, learning from proteins coded for by genomes is a difficult task. Bioinformatics is thus a burgeoning field that holds great promise for deepening our understanding of biochemical pathways, for understanding the genetic differences between species and how they arose, and for understanding the genetic basis of various disease processes. We developed a method for classification and knowledge discovery in membrane and intrinsic unstructured/disordered proteins (IUP). We analyzed the amino acid compositions and biophysical properties of proteins. Our joint transmembrane and IUP predictor utilized biophysical characterizations, feature generation, feature selection and computational intelligence as well as ensemble methods to improve the accuracies and performances
Keywords
biology computing; biomembranes; data mining; feature extraction; genetics; learning (artificial intelligence); molecular biophysics; proteins; amino acid composition; bioinformatics; biophysical characterization; computational intelligence; data mining; feature generation; feature selection; genetic difference; genome sequence; intrinsic disordered protein; intrinsic unstructured protein; knowledge discovery; learning; membrane protein; transmembrane; Amino acids; Bioinformatics; Biomembranes; Character generation; Computational intelligence; Diseases; Genetics; Genomics; Protein engineering; Sequences; Bioinformatics; classification and knowledge discovery; data mining and computational intelligence; intrinsic unstructured proteins; membrane proteins;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence Methods and Applications, 2005 ICSC Congress on
Conference_Location
Istanbul
Print_ISBN
1-4244-0020-1
Type
conf
DOI
10.1109/CIMA.2005.1662343
Filename
1662343
Link To Document