Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2026 Jan 14;32(1):8.
doi: 10.1007/s10985-025-09676-9.

Beyond Bonferroni: new multiple contrast tests for time-to-event data under non-proportional hazards

Affiliations

Beyond Bonferroni: new multiple contrast tests for time-to-event data under non-proportional hazards

Ina Dormuth et al. Lifetime Data Anal. .

Abstract

When comparing multiple groups in clinical trials, we are not only interested in whether there is a difference between any groups but rather where the difference is. Such research questions lead to testing multiple individual hypotheses. To control the familywise error rate (FWER), we must apply some corrections or introduce tests that control the FWER by design. In the case of time-to-event data, a Bonferroni-corrected log-rank test is commonly used. This approach has two significant drawbacks: (i) it loses power when the proportional hazards assumption is violated and (ii) the correction generally leads to a lower power, especially when the test statistics are not independent. We propose two new tests based on combined weighted log-rank tests. One is a simple multiple contrast test of weighted log-rank tests, and one is an extension of the so-called CASANOVA test. The latter was introduced for factorial designs. We propose a new multiple contrast test based on the CASANOVA approach. Our test shows promise of being more powerful under crossing hazards and eliminates the need for additional p-value correction. We assess the performance of our tests through extensive Monte Carlo simulation studies covering both proportional and non-proportional hazard scenarios. Finally, we apply the new and reference methods to a real-world data example. The new approaches control the FWER and show reasonable power in all scenarios. They outperform the adjusted approaches in some non-proportional settings in terms of power.

Keywords: Multiple contrast tests; Non-proportional hazards; Survival analysis; Weighted log-rank test.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
FWER under formula image for all settings for the Dunnett-type (left) and Tukey-type (right) contrast matrices. The dashed lines represent the borders of the binomial precision interval formula image
Fig. 2
Fig. 2
Local power over all tests under the alternative for Dunnett-type contrasts for all four scenarios (each boxplot contains 1136 data points)
Fig. 3
Fig. 3
Local power over all tests under the alternative for Tukey-type contrasts for all four scenarios (each boxplot contains 2016 data points)
Fig. 4
Fig. 4
Kaplan–Meier plot of the seven treatment groups of patients with multiple myeloma (MM)

References

    1. Andersen PK, Borgan O, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer, Berlin
    1. Blanche P, Dartigues JF, Riou J (2022) A closed max-t test for multiple comparisons of areas under the ROC curve. Biometrics 78(1):352–363 - DOI - PubMed
    1. Bluhmki T, Dobler D, Beyersmann J, Pauly M (2019) The wild bootstrap for multivariate Nelson–Aalen estimators. Lifetime Data Anal 25:97–127 - DOI - PMC - PubMed
    1. Bluhmki T, Schmoor C, Dobler D, Pauly M, Finke J, Schumacher M, Beyersmann J (2018) A wild bootstrap approach for the Aalen–Johansen estimator. Biometrics 74(3):977–985 - DOI - PubMed
    1. Brendel M, Janssen A, Mayer CD, Pauly M (2014) Weighted Logrank permutation tests for randomly right censored life science data. Scand J Stat 41(3):742–761. 10.1111/sjos.12059 - DOI

LinkOut - more resources