Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 1;31(6):1195-1201.
doi: 10.1158/1055-9965.EPI-21-0876.

Accounting for Delayed Entry in Analyses of Overall Survival in Clinico-Genomic Databases

Affiliations

Accounting for Delayed Entry in Analyses of Overall Survival in Clinico-Genomic Databases

Daniel Backenroth et al. Cancer Epidemiol Biomarkers Prev. .

Abstract

Background: Clinico-genomic databases favor inclusion of long-term survivors, leading to potentially biased overall survival (OS) analyses. Risk set adjustments relying on the independent delayed entry assumption may mitigate this bias. We aimed to determine whether this assumption is satisfied in a dataset of patients with advanced non-small cell lung cancer (aNSCLC), and to give guidance for clinico-genomic OS analyses when the assumption is not satisfied.

Methods: We analyzed the association of timing of next-generation sequencing (NGS) testing with real-world OS (rwOS) in patient data from a United States-based nationwide longitudinal deidentified electronic health records-derived database. Estimates of rwOS using risk set adjustment were compared with estimates computed with respect to all patients, regardless of NGS testing.

Results: The independent delayed entry assumption was not satisfied in this database, and later sequencing had a negative association with the hazard of death after sequencing. In a model adjusted for relevant characteristics, each month delay in sequencing was associated with a 2% increase in the hazard of death. However, until the median survival time, estimates of OS using risk set adjustment are similar to estimates computed for all patients, regardless of NGS testing.

Conclusions: rwOS analyses in clinico-genomic databases should assess the independent delayed entry assumption. Comparisons versus broader population may be useful to evaluate the rwOS differences between calculations using risk set adjustment and patient cohorts where the bias relates to overrepresentation of long survivors.

Impact: This study illustrates practices that can increase the interpretability of findings from OS analyses in clinico-genomic databases.

PubMed Disclaimer

Figures

Figure 1. Baseline characteristics of all patients in the EHR (left) and all patients in the EHR with an NGS test date before 60 days after the advanced diagnosis date (right), as a function of the year of advanced diagnosis. 95% confidence intervals are included (shaded areas). Dx, diagnosis.
Figure 1.
Baseline characteristics of all patients in the EHR (left) and all patients in the EHR with an NGS test date before 60 days after the advanced diagnosis date (right), as a function of the year of advanced diagnosis. 95% confidence intervals are included (shaded areas). Dx, diagnosis.
Figure 2. Survival functions for patients with nonsquamous histology without a record of EGFR+ or ALK+ alterations. Survival functions are included for (i) all patients in the EHR (all); (ii) NGS-tested patients in the EHR, without accounting for delayed entry (naïve); (iii) NGS-tested patients in the EHR whose entry date is on or prior to the start of first-line therapy (pre-L1); and (iv) NGS-tested patients in the EHR, using risk set adjustment (RSA). A, Patients who received carboplatin and paclitaxel or pemetrexed (potentially also with bevacizumab) as their first-line therapy starting pre-2017, survival functions from start of first-line therapy. B, Survival functions for patients who received carboplatin, pembrolizumab, and pemetrexed as their first-line therapy starting post-2016.
Figure 2.
Survival functions for patients with nonsquamous histology without a record of EGFR+ or ALK+ alterations. Survival functions are included for (i) all patients in the EHR (all); (ii) NGS-tested patients in the EHR, without accounting for delayed entry (naïve); (iii) NGS-tested patients in the EHR whose entry date is on or prior to the start of first-line therapy (pre-L1); and (iv) NGS-tested patients in the EHR, using risk set adjustment (RSA). A, Patients who received carboplatin and paclitaxel or pemetrexed (potentially also with bevacizumab) as their first-line therapy starting pre-2017, survival functions from start of first-line therapy. B, Survival functions for patients who received carboplatin, pembrolizumab, and pemetrexed as their first-line therapy starting post-2016.
Figure 3. Estimates of the survival function using risk set adjustment from the simulation study, in which samples of 1,000 patients with 25%, 50%, and 75% of patients entering prior to the start of treatment were created. One hundred samples were created for each of the different percentages of patients entering prior to the index date, and the survival functions shown are the median survival functions across each of those samples. The survival function for the source population (with 48% of patients entering prior to the start of treatment) is essentially the same as that illustrated here for 50% of patients entering prior to the start of treatment.
Figure 3.
Estimates of the survival function using risk set adjustment from the simulation study, in which samples of 1,000 patients with 25%, 50%, and 75% of patients entering prior to the start of treatment were created. One hundred samples were created for each of the different percentages of patients entering prior to the index date, and the survival functions shown are the median survival functions across each of those samples. The survival function for the source population (with 48% of patients entering prior to the start of treatment) is essentially the same as that illustrated here for 50% of patients entering prior to the start of treatment.

References

    1. Singal G, Miller PG, Agarwala V, Li G, Kaushik G, Backenroth D, et al. . Association of patient characteristics and tumor genomics with clinical outcomes among patients with non–small cell lung cancer using a clinicogenomic database. JAMA 2019;321:1391–9. - PMC - PubMed
    1. Hyman DM, Solit DB, Arcila ME, Cheng DT, Sabbatini P, Baselga J, et al. . Precision medicine at Memorial Sloan Kettering Cancer Center: clinical next-generation sequencing enabling next-generation targeted therapy trials. Drug Discov Today 2015;20:1422–8. - PMC - PubMed
    1. Smyth LM, Zhou QC, Yu C, Lepisto E, Arnedos M, Hasset M, et al. . Use of AACR Project GENIE, a clinicogenomic registry, to define the natural history of AKT1E17KMutant ER+/HER2- metastatic breast cancer (MBC). In:Proceedings of the 110th Annual Meeting of the American Association for Cancer Research; 2019March 29 - April 3; Atlanta, GA. Philadelphia (PA): AACR; 2019.
    1. Goldberg P. ASCO forms collaboration with two Big Data firms to grow CancerLinQ. The Cancer LetterDecember 21, 2017, Issue47.
    1. Sridhara R. Using real-world data to generate potential synthetic control arms: the AACR Project GENIE Experience. In:Proceedings of the 110th Annual Meeting of the American Association for Cancer Research; 2019March 29 - April 3; Atlanta, GA. Philadelphia (PA): AACR; 2019.

LinkOut - more resources