A statistical framework for analyzing deep mutational scanning data

作者:Rubin Alan F; Gelman Hannah; Lucas Nathan; Bajjalieh Sandra M; Papenfuss Anthony T; Speed Terence P; Fowler Douglas M*
来源:Genome Biology, 2017, 18(1): 150.
DOI:10.1186/s13059-017-1272-5

摘要

Deep mutational scanning is a widely used method for multiplex measurement of functional consequences of protein variants. We developed a new deep mutational scanning statistical model that generates error estimates for each measurement, capturing both sampling error and consistency between replicates. We apply our model to one novel and five published datasets comprising 243,732 variants and demonstrate its superiority in removing noisy variants and conducting hypothesis testing. Simulations show our model applies to scans based on cell growth or binding and handles common experimental errors. We implemented our model in Enrich2, software that can empower researchers analyzing deep mutational scanning data.