DocumentCode
729406
Title
Similarity analysis based on sparse representation for protein sequence comparison
Author
Lina Yang ; Yuan Yan Tang ; Yulong Wang ; Huiwu Luo ; Jianjia Pan ; Haoliang Yuan ; Xianwei Zheng ; Chunli Li ; Ting Shu
Author_Institution
Dept. of Comput. & Inf. Sci., Univ. of Macau, Macau, China
fYear
2015
fDate
24-26 June 2015
Firstpage
382
Lastpage
387
Abstract
This paper propose a least square-based sparse representation algorithm to analyze similarity comparison of protein sequences in the area of bioinformatics and molecular biology, which helps the prediction and classification of protein structure and function. The protein sequences are represented into the 1-dimensional feature vectors by their biochemical quantities. Then using the least square method to form the feature vector. Through the similarity calculation, the distance matrix can be obtained, by which, the phylogenic tree can be constructed.We apply this approach by analyzing the ND5 (NADH dehydrogenase subunit 5) protein cluster dataset. The experimental results show that the proposed model is more accurate than the Su´s model,and it is closer with some known biological facts.
Keywords
bioinformatics; data structures; least squares approximations; pattern classification; proteins; vectors; bioinformatics; feature vector; least square-based sparse representation algorithm; molecular biology; protein sequence; protein structure classification; protein structure prediction; similarity analysis; Amino acids; Encoding; Matching pursuit algorithms; Protein sequence; Training; Feature extraction; l1 -regularized least squares; protein sequence analysis; sparse representation;
fLanguage
English
Publisher
ieee
Conference_Titel
Cybernetics (CYBCONF), 2015 IEEE 2nd International Conference on
Conference_Location
Gdynia
Print_ISBN
978-1-4799-8320-9
Type
conf
DOI
10.1109/CYBConf.2015.7175964
Filename
7175964
Link To Document