An Efficient Binomial Model-Based Measure for Sequence Comparison and its Application

Liu, Xiaoqing; Dai, Qi<sup>*</sup>; Li, Lihua; He, Zerong

doi:10.1080/07391102.2011.10508611

摘要

Sequence comparison is one of the major tasks in bioinformatics, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations. There are several similarity/dissimilarity measures for sequence comparison, but challenges remains. This paper presented a binomial model-based measure to analyze biological sequences. With help of a random indicator, the occurrence of a word at any position of sequence can be regarded as a random Bernoulli variable, and the distribution of a sum of the word occurrence is well known to be a binomial one. By using a recursive formula, we computed the binomial probability of the word count and proposed a binomial model-based measure based on the relative entropy. The proposed measure was tested by extensive experiments including classification of HEV genotypes and phylogenetic analysis, and further compared with alignment -based and alignment-free measures. The results demonstrate that the proposed measure based on binomial model is more efficient.

出版日期2011-4
单位杭州电子科技大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-07-10 21:30

An Efficient Binomial Model-Based Measure for Sequence Comparison and its Application

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友