Systema: a framework for evaluating genetic perturbation response prediction beyond systematic variation
- PMID: 40854979
- DOI: 10.1038/s41587-025-02777-8
Systema: a framework for evaluating genetic perturbation response prediction beyond systematic variation
Abstract
Predicting transcriptional responses to genetic perturbations is challenging in functional genomics. While recent methods aim to infer effects of untested perturbations, their true predictive power remains unclear. Here, we show that current methods struggle to generalize beyond systematic variation, the consistent transcriptional differences between perturbed and control cells arising from selection biases or confounders. We quantify this variation in ten datasets, spanning three technologies and five cell lines, and show that common metrics are susceptible to these biases, leading to overestimated performance. To address this, we introduce Systema, an evaluation framework that emphasizes perturbation-specific effects and identifies predictions that correctly reconstruct the perturbation landscape. Using this framework, we uncover insights into the predictive capabilities of existing methods and show that predicting responses to unseen perturbations is substantially harder than standard metrics suggest. Our work highlights the importance of heterogeneous gene panels and disentangles predictive performance from systematic effects, enabling biologically meaningful developments in perturbation response modeling.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: M.W. is a part-time employee of Genentech. Z.P. is a full-time employee of Genentech. S.A.T. is a scientific advisory board member of ForeSite Labs, OMass Therapeutics and QIAGEN; a co-founder and equity holder of TransitionBio and EnsoCell Therapeutics; a non-executive director of 10x Genomics; and a part-time employee of GlaxoSmithKline. The remaining authors declare no competing interests.
References
-
- Roohani, Y., Huang, K. & Leskovec, J. Predicting transcriptional outcomes of novel multigene perturbations with GEARS. Nat. Biotechnol. 42, 927–35 (2024). - PubMed
-
- Cui, H. et al. scGPT: toward building a foundation model for single-cell multi-omics using generative AI. Nat. Methods 21, 1470–80 (2024). - PubMed
-
- Hao, M. et al. Large-scale foundation model on single-cell transcriptomics. Nat. Methods 21, 1481–91 (2024). - PubMed
-
- Ahlmann-Eltze, C., Huber, W. & Anders, S. Deep learning-based predictions of gene perturbation effects do not yet outperform simple linear methods. Preprint at bioRxiv https://doi.org/10.1101/2024.09.16.613342 (2024).
-
- Wenteler, A. et al. PertEval-scFM: benchmarking single-cell foundation models for perturbation effect prediction. Preprint at bioRxiv https://doi.org/10.1101/2024.10.02.616248 (2024).
Grants and funding
LinkOut - more resources
Full Text Sources
