DocumentCode :
2010325
Title :
Aggregating Homologous Protein Families in Evolutionary Reconstructions of Herpesviruses
Author :
Mirkin, Boris ; Camargo, Renata ; Fenner, Trevor ; Loizou, George ; Kellam, Paul
Author_Institution :
Sch. of Comput. Sci. & Inf. Syst., London Univ.
fYear :
2006
fDate :
28-29 Sept. 2006
Firstpage :
1
Lastpage :
8
Abstract :
Protein families can be used to reconstruct evolutionary histories of organisms. The accuracy of protein assignment to such families is critical for the success of such studies. Here we investigate the automatic aggregation of motif-defined homologous protein families for further reconstruction of their evolutionary histories. We propose a method that utilises only parameters that can be adjusted by using the data. The building blocks of the method include: (a) a majority rule for combining protein homologous neighbourhood lists into that for a family, and (b) a robust clustering procedure whose only parameter, the similarity shift, can be estimated from information on proteins with known function. The method is applied to a herpesvirus protein dataset leading to insights into the composition of ancestors of herpesvirus superfamilies. Comparison of the computational reconstructions with more comprehensive analyses also show how alignment-based between-protein similarity scoring can be improved by using data on gene arrangements
Keywords :
biology computing; data handling; evolutionary computation; microorganisms; proteins; clustering procedure; evolutionary reconstruction; gene arrangement; herpesvirus protein dataset; herpesvirus superfamily; homologous protein family; protein assignment; protein information; protein similarity scoring; similarity shift; Animals; Bioinformatics; Genomics; History; Organisms; Pathogens; Phylogeny; Proteins; Robustness; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Bioinformatics and Computational Biology, 2006. CIBCB '06. 2006 IEEE Symposium on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0623-4
Electronic_ISBN :
1-4244-0624-2
Type :
conf
DOI :
10.1109/CIBCB.2006.330944
Filename :
4133180
Link To Document :
بازگشت