Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Mar 14:2023.11.27.23299062.
doi: 10.1101/2023.11.27.23299062.

Proteome-wide model for human disease genetics

Proteome-wide model for human disease genetics

Rose Orenbuch et al. medRxiv. .

Update in

  • Proteome-wide model for human disease genetics.
    Orenbuch R, Shearer CA, Kollasch AW, Spinner AD, Hopf T, van Niekerk L, Franceschi D, Dias M, Frazer J, Marks DS. Orenbuch R, et al. Nat Genet. 2025 Dec;57(12):3165-3174. doi: 10.1038/s41588-025-02400-1. Epub 2025 Nov 24. Nat Genet. 2025. PMID: 41286104 Free PMC article.

Abstract

Identifying variants driving disease accelerates both genetic diagnosis and therapeutic development, but missense variants still present a bottleneck as their effects are less straightforward than truncations or nonsense mutations. While computational prediction methods are sufficiently accurate to be of clinical value for variants in known disease genes, they do not generalize well to other genes as the scores are not calibrated across the proteome 1-6 . To address this, we developed a deep generative model, popEVE, that combines evolutionary information with population sequence data 7 and achieves state-of-the-art performance on a suite of proteome-wide prediction tasks, without overestimating the prevalence of deleterious variants in the population. popEVE identifies 442 genes in a developmental disorder cohort 8 , including evidence of 123 novel candidates, many without the need for cohort-wide enrichment. Candidate genes are functionally similar to known developmental disorder genes and case variants tend to fall in functionally important regions of these genes. Finally, we show that these findings can be reproduced from analysis of the patient exomes alone, demonstrating that popEVE provides a new avenue for genetic analysis in situations where traditional methods fail, including genetic diagnosis of rare-as-one diseases, even in the absence of parent sequencing.

PubMed Disclaimer

Publication types