A Sequence-Segmented Method Applied to the Similarity Analysis of Long Protein Sequence

作者:Yao Yu hua*; Kong Fen; Dai Qi; He Ping an
来源:MATCH-Communications in Mathematical and in Computer Chemistry, 2013, 70(1): 431-450.

摘要

A 2-D graphical representation of protein sequences based on two classifications of amino acids is outlined. We transform the characteristic graphs into numerical characterization and used for similarity analysis of proteins. The method of dividing a long protein sequence into segments (SSM) is introduced, so protein graph is divided into k segments, geometrical center of the points for all protein curve segments is given as descriptors of proteins. It is not only useful for comparative study of proteins, but also for encoding amino acids in ways that the visualization of protein sequences facilitates the decoding of its information content. In addition, a simple example applied to the helicase proteins of 12 baculoviruses is taken to highlight the behavior of the new strategy.