Near-medians that avoid the corners; a combinatorial probability approach

作者:Larlee Caroline Anne; Zheng Chunfang; Sankoff David*
来源:BMC Genomics, 2014, 15(Suppl 6): S1.
DOI:10.1186/1471-2164-15-S6-S1

摘要

Background: The breakpoint median for a set of k %26gt;= 3 random genomes tends to approach (any) one of these genomes (%26quot;corners%26quot;) as genome length increases, although there are diminishing proportion of medians equidistant from all k (%26quot;medians in the middle%26quot;). Algorithms are likely to miss the latter, and this has consequences for the general case where input genomes share some or many gene adjacencies, where the tendency for the median to be closer to one input genome may be an artifact of the corner tendency. %26lt;br%26gt;Results: We present a simple sampling procedure for constructing a %26quot;near median%26quot; that represents a compromise among k random genomes and that has only a slightly greater breakpoint distance to all of them than the median does. We generalize to the realistic case where genomes share varying proportions of gene adjacencies. We present a supplementary sampling scheme that brings the constructed genome even closer to median status. %26lt;br%26gt;Conclusions: Our approach is of particular use in the phylogenetic context where medians are repeatedly calculated at ancestral nodes, and where the corner effect prevents different parts of the phylogeny from communicating with each other.

  • 出版日期2014-10-17