Using Data-Display Networks for Exploratory Data Analysis in Phylogenetic Studies

Morrison David A<sup>*</sup>

doi:10.1093/molbev/msp309

摘要

Exploratory data analysis (EDA) is a frequently undervalued part of data analysis in biology. It involves evaluating the characteristics of the data "before" proceeding to the definitive analysis in relation to the scientific question at hand. For phylogenetic analyses, a useful tool for EDA is a data-display network. This type of network is designed to display any character (or tree) conflict in a data set, without prior assumptions about the causes of those conflicts. The conflicts might be caused by 1) methodological issues in data collection or analysis, 2) homoplasy, or 3) horizontal gene flow of some sort. Here, I explore 13 published data sets using splits networks, as examples of using data-display networks for EDA. In each case, I performed an original EDA on the data provided, to highlight the aspects of the resulting network that will be important for an interpretation of the phylogeny. In each case, there is at least one important point (possibly missed by the original authors) that might affect the phylogenetic analysis. I conclude that EDA should play a greater role in phylogenetic analyses than it has done.

出版日期2010-5

全文

访问全文

收藏分享被引(39) 浏览

更新时间：2024-05-03 08:09

Using Data-Display Networks for Exploratory Data Analysis in Phylogenetic Studies

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友