Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2025 Aug 4;15(1):28407.
doi: 10.1038/s41598-025-08808-3.

Optimized summary-statistic-based single-cell eQTL meta-analysis

Collaborators, Affiliations
Meta-Analysis

Optimized summary-statistic-based single-cell eQTL meta-analysis

Maryna Korshevniuk et al. Sci Rep. .

Abstract

The identification of expression quantitative trait loci (eQTLs) holds great potential to improve the interpretation of disease-associated genetic variation. As many such disease-associated variants act in a context-, tissue- or even cell-type-specific manner, single-cell RNA-sequencing (scRNA-seq) data is uniquely suitable for identifying the specific cell type or context in which these genetic variants act. However, due to the limited sample sizes in single-cell studies, discovery of cell-type-specific eQTLs is now limited. To improve power to detect such eQTLs, large-scale joint analyses are needed. These are however, complicated by privacy constraints due to sharing of genotype data and the measurement and technical variety across different scRNA-seq datasets as a result of differences in mRNA capture efficiency, experimental protocols, and sequencing strategies. A solution to these issues is a federated weighted meta-analysis (WMA) approach in which summary statistics are integrated using dataset-specific weights. Here, we compare different strategies and provide best practice recommendations for eQTL WMA across scRNA-seq datasets.

Keywords: Weighted meta-analysis; eQTL; scRNA-seq.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Workflow and weighting strategies for meta-analysis of single-cell eQTL data. (a) The general workflow for weighted meta-analysis. (b) The cohort-, gene-, SNP-, and eQTL-specific weights used in the meta-analysis. (c, d) Description of the (c) peripheral blood mononuclear cell (PBMC) and (d) induced pluripotent stem cell (iPSC) datasets used in the analysis. The PBMC data consists of five single-cell datasets (n = 187 donors: 141 with 10X V2 and 46 with 10X V3 chemistry). The iPSC data consists of two single-cell datasets (n = 87 donors: 87 Smart-Seq2 and 25 donor-matched 10X V2). To analyze independent sample sets and test for the effects of combining smaller datasets, we split the 87 Smart-Seq2 samples into Set A (62 non-10X-matched samples) and Set C (25 10X-matched samples). Set A was further split into a smaller set of 25 non-10X-matched samples (Set D).
Fig. 2
Fig. 2
Results of the improved sc-eQTL weighted meta-analysis in the PBMC pseudobulk datasets: (ad) Comparison of the fraction of eGenes detected and accuracy (F1* score) in WMA over all five datasets (a), 10X V3 chemistry (b), 10X V2 and V3 chemistries (c), and 10X V2 chemistry combinations (d). e Three pairwise 10X V2 chemistry dataset combinations. The expansion at right enlarges the top-performing weights in these comparisons. (f, g) Comparison of the best-performing weights among 10 pairwise dataset combinations in terms of the (f) increase in the number of eGenes and (g) the weighted mean of the eGenes change in comparison to sample-size weighting.
Fig. 3
Fig. 3
Meta-analysis of sample-size-like weights in monocyte datasets. (ad) Combination of the number of eGenes detected and accuracy (F1* score) for (a) all five datasets and (b) sample-size-wise the largest pairwise comparisons of 10X V3 chemistry, (c) 10X V2 and V3 chemistries, and (d) 10X V2 chemistry combinations. (e) Three pairwise 10X V2 chemistry dataset combinations. The expansion at right enlarges the top-performing weights. (f, g) Comparison of the (f) main weights among 10 pairwise dataset combinations in terms of the increase in the number of eGenes and (g) the weighted mean of eGenes change.
Fig. 4
Fig. 4
Weighted meta-analysis of sample-size-like characteristics in iPSC samples. ad The number of eGenes detected and accuracy (F1* score) for all four iPSC datasets. e, f Results of the grid search in combinations of Smart-seq2 (Set B or C) and 10X (Set D) samples. Vertical lines indicate the ratio of weight of Set C versus weight of Set D (e) or of Set C versus Set D (f) for one of the weights of the WMA.
Fig. 5
Fig. 5
Weighted meta-analysis with secondary weights on a monocyte and b iPSC datasets employing secondary weights derived from gene expression features and minor allele frequency (MAF) in combination with overall best-performing sample-size-derived weights. a For monocyte samples, we used the average number of cells per donor as a main weight. b For iPSC samples, we used sample size as a main weight. Numbers in the heatmap indicate the change in the number of eGenes detected. Green and red indicate an increase or decrease in the number of eGenes compared to the main weight, respectively. Color saturation indicates the fold-change of the number of eGenes detected.

References

    1. Ashton, J. M. et al. Comparative analysis of single-Cell RNA sequencing platforms and methods. J. Biomol. Tech. JBT32, 3fc1f5fe.3eccea01 (2021). - PMC - PubMed
    1. Gezelius, H. et al. Comparison of high-throughput single-cell RNA-seq methods for ex vivo drug screening. NAR Genom. Bioinforma.6, lqae001 (2024). - PMC - PubMed
    1. Mereu, E. et al. Benchmarking single-cell RNA-sequencing protocols for cell atlas projects. Nat. Biotechnol.38, 747–755 (2020). - PubMed
    1. Svensson, V., Vento-Tormo, R. & Teichmann, S. A. Exponential scaling of single-cell RNA-seq in the past decade. Nat. Protoc.13, 599–604 (2018). - PubMed
    1. van der Wijst, M. et al. The single-cell eQTLGen consortium. Elife9, e52155 (2020). - PMC - PubMed

Publication types

LinkOut - more resources