DocumentCode
2517298
Title
Protein Sequences Analysis Based on Smoothed PST
Author
Liu, Yong ; Yan, Tuanjun ; Zhang, Rui
Author_Institution
Inst. of Intell. Vision & Image, Inf. China Three Gorges Univ., Yichang, China
fYear
2009
fDate
11-13 June 2009
Firstpage
1
Lastpage
4
Abstract
Sequence alignment algorithms, which are hardly to be efficient, are frequently used in protein sequences analysis. In order to improve the analyzing efficiency, an improved PST(Probabilistic Suffix Trees) model is proposed in this paper. Firstly, by analyzing the similarity between protein sequences analysis and sequences data mining, the idea of using PST model to analyze protein sequences is presented; And then the standard PST model is improved by smoothing operation according to the features of protein sequences analysis; Next, taking the smoothed PST as features of data set, the similarities degree between protein sequences are calculated by using the similarities of the normalized sequences; At last, the effectiveness and high efficiency of the algorithm are verified by some protein sequences analysis examples.
Keywords
biology computing; data mining; molecular biophysics; probability; proteins; smoothing methods; trees (mathematics); data set feature; protein sequences analysis; sequence alignment algorithm; sequence data mining; smoothed probabilistic suffix tree model; Algorithm design and analysis; Data analysis; Data mining; Image analysis; Image sequence analysis; Information analysis; Predictive models; Protein sequence; Smoothing methods; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-2901-1
Electronic_ISBN
978-1-4244-2902-8
Type
conf
DOI
10.1109/ICBBE.2009.5163257
Filename
5163257
Link To Document