A novel model for protein sequence similarity analysis based on spectral radius

Wu, Chuanyan; Gao, Rui<sup>*</sup>; De Marinis, Yang; Zhang, Yusen

doi:10.1016/j.jtbi.2018.03.001

摘要

Advances in sequencing technologies led to rapid increase in the number and diversity of biological sequences, which facilitated development in the sequence research. In this paper, we present a new method for analyzing protein sequence similarity. We calculated the spectral radii of 20 amino acids (AAs) and put forward a novel 2-D graphical representation of protein sequences. To characterize protein sequences numerically, three groups of features were extracted and related to statistical, dynamics measurements and fluctuation complexity of the sequences. With the obtained feature vector, two models utilizing Gaussian Kernel similarity and Cosine similarity were built to measure the similarity between sequences. We applied our method to analyze the similarities/dissimilarities of four data sets. Both proposed models received consistent results with improvements when compared to that obtained by the ClustalW analysis. The novel approach we present in this study may therefore benefit protein research in medical and scientific fields.

出版日期2018-6-7
单位山东大学

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2021-07-07 23:15

A novel model for protein sequence similarity analysis based on spectral radius

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友