Skip to main content
Fig. 1 | Genome Biology

Fig. 1

From: Optimizing expression quantitative trait locus mapping workflows for single-cell studies

Fig. 1

Overview of normalization, aggregation and the single-cell eQTL mapping considered. a RNA pre-processing steps to obtain count matrices to perform eQTL mapping, including gene expression quantification, cell and gene-level quality control (QC), and cell type annotation. These steps are not optimized/tested in this work (shown in gray). b Different approaches tested to perform eQTL mapping using scRNA-seq profiles. Starting from one gene x cell count matrix obtained as in a, counts were aggregated per sample (i.e., donor, or donor-run combination), either by summing the data first at the sample level and then normalizing using methods designed for bulk RNA-seq (i.e., TMM [1]) as implemented in edgeR or by first normalizing the single-cell counts (using scran/scater [20, 21]) and then calculating the mean or the median at the sample level. c eQTL mapping (cis). We map eQTL independently for each gene-SNP pair considered by fitting a linear mixed model. In particular, we model gene expression as the outcome variable (y), the SNP effect as well as additional covariates as fixed effects, and include one (or more) random effect (RE) term to account for population structure and sample variation. We considered various methods to compute covariates and tested different numbers of covariates as well. d Multiple testing correction is performed in two steps. First, gene-level p values are adjusted using a permutation scheme (“Methods”) to control the FWER across SNPs. Second, the top SNP per gene is selected (minimum adjusted p value; “Methods”) and various methods are used to control the FDR and obtain globally corrected p values. Steps that we optimize here are highlighted in blue in panels b, c, and d

Back to article page