摘要

The DNA sequencing problem is aimed at reconstructing an unknown fragment of DNA (deoxyribonucleic acid) based upon a set of oligonucleotides that makes up the fragment's spectrum. Such a spectrum is the result of a hybridization experiment, in which oligonucleotides hybridize with the unknown DNA sequence. The introduction of errors (positive as well as negative) in the biological experiment phase gives rise to a challenging combinatorial optimization problem. The DNA sequencing problem can be modeled as a variation of the classical traveling salesman problem and, due to its computational complexity, it is a candidate for the design and implementation of metaheuristic algorithms. We present a hybrid algorithm, which may be seen as an approach within the recently introduced area of matheuristics, i.e., an approach in which mathematical programming techniques and metaheuristic schemes are effectively intertwined. The algorithm is tested on 400 benchmark instances from the literature and compares favorably with the best known algorithm. In addition, an explanation concerning the relation between error distribution and algorithmic performance is provided, illustrating that the way in which negative errors are distributed within the spectrum has a bearing on the overall algorithmic performance.

  • 出版日期2014-1-30