An Overview of Population Genetic Data Simulation

作者:Yuan, Xiguo; Miller, David J.; Zhang, Junying; Herrington, David; Wang, Yue*
来源:Journal of Computational Biology, 2012, 19(1): 42-54.
DOI:10.1089/cmb.2010.0188

摘要

Simulation studies in population genetics play an important role in helping to better understand the impact of various evolutionary and demographic scenarios on sequence variation and sequence patterns, and they also permit investigators to better assess and design analytical methods in the study of disease-associated genetic factors. To facilitate these studies, it is imperative to develop simulators with the capability to accurately generate complex genomic data under various genetic models. Currently, a number of efficient simulation software packages for large-scale genomic data are available, and new simulation programs with more sophisticated capabilities and features continue to emerge. In this article, we review the three basic simulation frameworks-coalescent, forward, and resampling- and some of the existing simulators that fall under these frameworks, comparing them with respect to their evolutionary and demographic scenarios, their computational complexity, and their specific applications. Additionally, we address some limitations in current simulation algorithms and discuss future challenges in the development of more powerful simulation tools.