Skip to main content

Correction to: A statistical framework for analyzing deep mutational scanning data

The Original Article was published on 07 August 2017


After publication of our article [1] it was brought to our attention that a line of code was missing from our program to combine the within-replicate variance and between-replicate variance. This led to an overestimation of the standard errors calculated using the Enrich2 random-effects model.

This programming error has been corrected in v1.2.0 of the software, available from [2].

We have reanalyzed the data in the paper using v1.2.0. The results are very similar and do not significantly affect any of the manuscript’s conclusions. Notably, none of the numeric values quoted in the main text change.

Updated versions of Figs. 4, 5, 6 and 7 are provided below. Updated versions of Additional file 2 Figures S1, S2 and S4 and Additional files 3, 4 and 5 are available from the links below.

Fig. 4
figure 1

A random-effects model for scoring replicate selections. Variant scores for 20 randomly selected variants from the BRCA1 E3 ubiqutin ligase dataset are shown. The replicate scores (green) were determined for each variant using Enrich2 weighted regression. Combined variant score estimates were determined using a fixed-effect model (orange) or the Enrich2 random-effects model (blue). In all cases, error bars show +2 or −2 standard errors

Fig. 5
figure 2

Standard error-based filtering improves replicate correlation. Variant scores from two replicates of the C2 domain dataset are shown. Each panel plots the top quartile of variants selected by standard error from the random-effects model (leftmost column, blue points), standard error from the fixed-effect model (middle-left column, green points), input library count (middle-right column, orange points), or total count in all libraries (rightmost column, purple points). Scores and standard errors are calculated using only the input and final round of selection (top row) or using all three rounds (bottom row). The dashed line is the best linear fit and the Pearson correlation coefficient is shown

Fig. 6
figure 3

Standard errors enable hypothesis testing. a Enrich2 variant scores are plotted against single-variant growth assay scores for the 22 individually validated variants of the neuraminidase dataset. Four (18%) of these variants have Enrich2 standard errors larger than the median standard error. The dotted line shows the best linear fit for all variants and the dashed line shows the best linear fit for variants with standard errors less than the median. b Enrich2 variant scores are plotted for selections performed in the presence or absence of the small molecule inhibitor oseltamivir. Colored points indicate variants that significantly outperformed wild-type in the drug’s presence. Red points also scored significantly higher than wild-type in the drug’s absence. Triangles indicate the five “drug-adaptive” mutations identified originally [3]

Fig. 7
figure 4

Variant scoring for growth and binding experiments using simulated data. a Enrich2 variant effect scores derived from simulated data are plotted against expected Enrich2 scores based on true variant effects in the simulation. Enrich2 accurately scores variants in both simulated binding assays (left) and growth assays (right). Shading indicates point density from low (blue) to high (white). b Noisy variants were generated by randomizing their true effect in one replicate selection (green line). Noisy variants have higher overall standard errors than other variants (dashed gray line) in both binding and growth assay simulations. c The percentage of variants removed at each standard error percentile cutoff (5% intervals) is plotted. Standard error filtering preferentially removes noisy variants (green points)

We would like to thank Jörn Schmiedel for alerting us to this issue, helping us reproduce it and testing the fix.


  1. Rubin AF, Gelman H, Lucas N, Bajjalieh SM, Papenfuss AT, Speed TP, et al. A statistical framework for analyzing deep mutational scanning data. Genome Biol. 2017;18(1):150.

    Article  PubMed  PubMed Central  Google Scholar 


  3. Jiang L, Liu P, Bank C, Renzette N, Prachanronarong K, Yilmaz LS, et al. A balance between inhibitor binding and substrate processing confers influenza drug resistance. J Mol Biol. 2016;428:538–53.

    Article  CAS  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Douglas M. Fowler.

Additional information

The original article can be found online at

Additional files

Additional file 2:

Supplementary figures. (PDF 423 kb)

Additional file 3:

Replicate correlation tables. (XLSX 14 kb)

Additional file 4:

Individually validated variants of the neuraminidase gene. (XLSX 12 kb)

Additional file 5:

Variants with higher scores than wild-type in the presence of oseltamivir. (XLSX 15 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rubin, A.F., Gelman, H., Lucas, N. et al. Correction to: A statistical framework for analyzing deep mutational scanning data. Genome Biol 19, 17 (2018).

Download citation

  • Published:

  • DOI: