DocumentCode :
1330754
Title :
On Position-Specific Scoring Matrix for Protein Function Prediction
Author :
Jeong, Jong Cheol ; Lin, Xiaotong ; Chen, Xue-wen
Author_Institution :
Electr. Eng. & Comput. Sci. Dept., Univ. of Kansas, Lawrence, KS, USA
Volume :
8
Issue :
2
fYear :
2011
Firstpage :
308
Lastpage :
315
Abstract :
While genome sequencing projects have generated tremendous amounts of protein sequence data for a vast number of genomes, substantial portions of most genomes are still unannotated. Despite the success of experimental methods for identifying protein functions, they are often lab intensive and time consuming. Thus, it is only practical to use in silico methods for the genome-wide functional annotations. In this paper, we propose new features extracted from protein sequence only and machine learning-based methods for computational function prediction. These features are derived from a position-specific scoring matrix, which has shown great potential in other bininformatics problems. We evaluate these features using four different classifiers and yeast protein data. Our experimental results show that features derived from the position-specific scoring matrix are appropriate for automatic function annotation.
Keywords :
bioinformatics; learning (artificial intelligence); proteins; proteomics; automatic function annotation; bioinformatics; genome sequencing; machine learning; position-specific scoring matrix; protein function prediction; protein sequence; Amino acids; Bioinformatics; Feature extraction; Genomics; Probes; Protein sequence; Clustering; and association rules; classification; data mining; feature extraction or construction; mining methods and algorithms.; Algorithms; Artificial Intelligence; Computational Biology; Databases, Protein; Position-Specific Scoring Matrices; Proteins; Sequence Analysis, Protein;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2010.93
Filename :
5582078
Link To Document :
بازگشت