A new method to analyze the similarity of the DNA sequences

作者:Guo Ying*; Wang Tian Ming
来源:Journal of Molecular Structure (Theochem), 2008, 853(1-3): 62-67.
DOI:10.1016/j.theochem.2007.12.003

摘要

In this paper, we propose a new method to analyze the similarity/dissimilarity of DNA sequences based on the graphical representation proposed by Randic et al. (2003) [M. Randic, M. Vracko, L. Nella, P. Dejan, Chem. Phys. Lett. 368 (2003) 1]. Instead of calculating the leading eigenvalues of the matrix for graphical representation, we smooth the zigzag curve and calculate its curvature as the descriptor to numerical characterize DNA sequences. The proposed method is tested on two real data sets: the coding sequences of beta-globin gene and all of their exons. The reasonable results verify the validity of our method.