Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 27;14(1):6840.
doi: 10.1038/s41467-023-42354-8.

Reconstructing disease dynamics for mechanistic insights and clinical benefit

Affiliations

Reconstructing disease dynamics for mechanistic insights and clinical benefit

Amit Frishberg et al. Nat Commun. .

Abstract

Diseases change over time, both phenotypically and in their underlying molecular processes. Though understanding disease progression dynamics is critical for diagnostics and treatment, capturing these dynamics is difficult due to their complexity and the high heterogeneity in disease development between individuals. We present TimeAx, an algorithm which builds a comparative framework for capturing disease dynamics using high-dimensional, short time-series data. We demonstrate the utility of TimeAx by studying disease progression dynamics for multiple diseases and data types. Notably, for urothelial bladder cancer tumorigenesis, we identify a stromal pro-invasion point on the disease progression axis, characterized by massive immune cell infiltration to the tumor microenvironment and increased mortality. Moreover, the continuous TimeAx model differentiates between early and late tumors within the same tumor subtype, uncovering molecular transitions and potential targetable pathways. Overall, we present a powerful approach for studying disease progression dynamics-providing improved molecular interpretability and clinical benefits for patient stratification and outcome prediction.

PubMed Disclaimer

Conflict of interest statement

S.S.O. holds equity and is a consultant of CytoReason. A.F., E.B., and K.R.B. are employees and hold equity in CytoReason. F.J.T. reports receiving consulting fees from ImmunAI and CytoReason and ownership interest in Dermagnostix. S.P. receives speaker and consultant honoraria from and has served on advisory boards for Abbott, Alcon, Geuder, Oculus, Schwind, STAAR, TearLab, Thieme Compliance, Ziemer, Zeiss and research funding from Abbott, Alcon, Hoya, Oculentis, Oculus, Schwind and Zeiss. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. TimeAx discovers the shared disease dynamics across multiple patients.
A Disease progression is not captured by patient subtyping. While current patient stratification and disease state comparison requires clustering of patients into coarse subgroups (top), understanding disease dynamics as a common ground between patients’ disease trajectories enables patient stratification in a higher resolution (bottom). B TimeAx seed features selection process. TimeAx selects a “conserved-dynamics-seed” with shared dynamics across all subjects. In the heatmap (left), each row is a patient and each column is a sample collected at a different time. Each feature spans multiple patients (multiple rows) and is also presented as a line plot (right). C An illustration describing TimeAx’s utility for revealing the shared disease progression dynamics and the projection of patients-specific disease pseudotime positions, integrating the differences between patients’ disease trajectories over time. D Disease pseudotime can be utilized to discover novel disease mechanisms as well as to support new clinical frameworks for patient stratification and outcome prediction. We thank Yuval Abraham for his contribution in the design and creation of (A, C and D).
Fig. 2
Fig. 2. Disease pseudotime captures disease progression dynamics better than chronological time.
A An illustration of the influenza infection dynamics TimeAx modeling and disease pseudotime inference, based on the longitudinal influenza cohort. B TimaAx disease pseudotime (y-axis) is different from chronological time (time from infection, x-axis). Shown are the differences between symptomatic (green lines) and asymptomatic (red lines) patients from the longitudinal influenza cohort. P value was calculated by comparing the prediction of disease pseudotime (by ANOVA), using only time or the interaction between time and symptoms as predictors. Trend lines represent the average levels in each of the groups ± standard error. C Gene associations (−log10 transformed, FDR-corrected, Q values based on linear regression), across all symptomatic and asymptomatic patients, using either sampling time (x-axis) or disease pseudotime (y-axis), and applying a Q value threshold of 10−5 (Dashed lines) (See Methods). Genes are colored based on their association with the two time axes. D An illustration of the UBC dynamics TimeAx modeling and disease pseudotime inference, based on the UBC longitudinal cohort. E TimeAx disease pseudotime (y-axis) is different from chronological time (time from primary tumor, x-axis), exemplified in six patients in the UBC longitudinal cohort. F Gene associations (−log10 transformed, P values based on linear regression) with sampling time (x-axis) and disease pseudotime (y-axis), using a P value threshold of 10−2 (Dashed lines) (See Methods). Genes are colored based on their association with the two time axes, displaying significant associations almost entirely only with the disease pseudotime. G An illustration of the AMD dynamics TimeAx modeling and disease pseudotime inference, based on segmented features extracted from OCT scans of the patients’ retina. Optical coherence tomography (OCT). H TimeAx disease pseudotime (y-axis) is different from sampling time (time from first encounter, x-axis), exemplified in five patients in the AMD train cohort. I The distribution of AMD test cohort patients’ disease pseudotime positions (top; n = 29205 biologically independent samples) and times from diagnosis (bottom; n = 11075 biologically independent samples) (y-axis), across different visual severity states, determined according to the patients’ visual acuity levels (logMAR; x-axis). Boxes represent the 25th, 50th, and 75th percentiles; whiskers show maxima and minima. P values were calculated based on linear regression. We thank Yuval Abraham for his contribution in the design and creation of (A, D and G).
Fig. 3
Fig. 3. TimeAx uncovers an advanced tumor state with unfavorable clinical outcomes.
A Disease pseudotime clinical applicability. Shown are disease pseudotime (y-axis) relation with tumor stage (x-axis) for samples within the UBC longitudinal cohort. P value was calculated based on linear regression. B Tumor purity scores (y-axis) along the disease pseudotime (left) and the time from primary tumor (right) (x-axis), displaying a sharp decrease in high disease pseudotime positions (SPIP; dashed line). Disease pseudotime ranges pre and post the SPIP are marked by colored bars. C Cell type deconvolved cell contributions (y-axis), displaying major differences between pre- and post- SPIP samples. * p < 0.05, ** p < 0.01 based on a two-sided t-test (p < 10−4, 0.003, 0.05, 0.05, 10−4; Urothelial cells, Macrophages, naive T cells, memory T cells and Fibroblasts, respectively). D Survival plot for UBC patients within the TCGA test cohort, comparing patients’ tumors with disease pseudotime positions lower and higher the SPIP (pre versus post, respectively; color coded). P value was calculated based on a log-rank test. E Survived (black) versus deceased (white) percentages of basal/squamous patients within the TCGA test cohort pre and post SPIP (x-axis). p value was calculated using Fisher’s exact test. In (A and C), boxes represent the 25th, 50th, and 75th percentiles; whiskers show maxima and minima and n = 84 biologically independent samples. Stromal pro-invasion point (SPIP).
Fig. 4
Fig. 4. Disease pseudotime captures variation undetectable by current stratification frameworks.
A Disease pseudotime distribution across tumor molecular classifications in patients within the UBC longitudinal cohort based on the ‘LundTax’ molecular subtyping framework. B Distribution of ‘LundTax’ tumor molecular classifications for pre (red) and post (blue) stromal pro-invasion samples within the UBC longitudinal cohort. C Comparison of disease pseudotime (y-axis) between primary and recurrent tumors in pre-SPIP samples within the UBC longitudinal cohort. D Disease pseudotime distribution across tumor molecular classifications in patients pre-SPIP within the UBC longitudinal cohort based on the ‘LundTax’ molecular subtyping framework. E Percent of surviving patients in the TCGA test cohort with UroA tumors (y-axis) across disease pseudotime bins (bin size = 0.1; x-axis) pre-SPIP. p value was calculated based on linear regression. In (A, C and D), boxes represent the 25th, 50th, and 75th percentiles; whiskers show maxima and minima and n = 84 biologically independent samples.
Fig. 5
Fig. 5. Disease pseudotime uncovers molecular mechanisms promoting UBC progression.
A Co-expression matrix for genes (rows, columns) differentially expressed (q < 0.05) between early and late UroA tumors. Two gene sets were discovered and are highlighted in green and purple. B High enrichment of different pseudogene families within the downregulated module, compared to the upregulated module set and a random gene set (color-coded). C Pathway enrichment for the downregulated module, including pathway enrichment scores (left) and a heatmap of the expression levels of these pathways genes (columns), in early and late UroA tumors (rows) within the UBC longitudinal cohort. D Pathway enrichment scores for the upregulated module. E An illustration of the molecular model of UBC tumorigenesis, discovered by the TimeAx-based analysis, based on the increasing gene module. The illustration includes the association of known hallmarks of cancer with disease pseudotime (top) and the suggested mitotic kinetochore- spindle microtubules (MT) interaction (middle), which is also presented as boxplots of differential expression of its subunits (bottom; n = 40 biologically independent samples, boxes represent the 25th, 50th, and 75th percentiles; whiskers show maxima and minima). G protein-coupled receptors (GPCRs). We thank Yuval Abraham for his contribution in the design and creation of (E).

References

    1. Velten B, et al. Identifying temporal and spatial patterns of variation from multimodal data using MEFISTO. Nat. Methods. 2022;19:179–186. doi: 10.1038/s41592-021-01343-9. - DOI - PMC - PubMed
    1. Coifman RR, et al. Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. Proc. Natl Acad. Sci. USA. 2005;102:7426–7431. doi: 10.1073/pnas.0500334102. - DOI - PMC - PubMed
    1. Chari, T., Banerjee, J. & Pachter, L. The Specious Art of Single-Cell Genomics. BioRxiv10.1101/2021.08.25.457696 (2021).
    1. Fischer DS, Theis FJ, Yosef N. Impulse model-based differential expression analysis of time course sequencing data. Nucleic Acids Res. 2018;46:e119. - PMC - PubMed
    1. Spies D, Renz PF, Beyer TA, Ciaudo C. Comparative analysis of differential gene expression tools for RNA sequencing time course data. Brief. Bioinforma. 2019;20:288–298. doi: 10.1093/bib/bbx115. - DOI - PMC - PubMed

Publication types

MeSH terms