Accumulation Tests for FDR Control in Ordered Hypothesis Testing

Li Ang<sup>*</sup>; Barber Rina Foygel

doi:10.1080/01621459.2016.1180989

摘要

Multiple testing problems arising in modern scientific applications can involve simultaneously testing thousands or even millions of hypotheses, with relatively few true signals. In this article, we consider the multiple testing problem where prior information is available (for instance, from an earlier study under different experimental conditions), that can allow us to test the hypotheses as a ranked list to increase the number of discoveries. Given an ordered list of n hypotheses, the aim is to select a data-dependent cutoff k and declare the first k hypotheses to be statistically significant while bounding the false discovery rate (FDR). Generalizing several existing methods, we develop a family of accumulation tests to choose a cutoff k that adapts to the amount of signal at the top of the ranked list. We introduce a new method in this family, the HingeExp method, which offers higher power to detect true signals compared to existing techniques. Our theoretical results prove that these methods control a modified FDR on finite samples, and characterize the power of the methods in the family. We apply the tests to simulated data, including a high-dimensional model selection problem for linear regression. We also compare accumulation tests to existing methods for multiple testing on a real data problem of identifying differential gene expression over a dosage gradient. Supplementary materials for this article are available online.

出版日期2017-6

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2021-01-20 13:40

Accumulation Tests for FDR Control in Ordered Hypothesis Testing

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友