Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 15;41(6):964-980.
doi: 10.1002/sim.9298. Epub 2022 Jan 10.

B-value and empirical equivalence bound: A new procedure of hypothesis testing

Affiliations

B-value and empirical equivalence bound: A new procedure of hypothesis testing

Yi Zhao et al. Stat Med. .

Abstract

In this study, we propose a two-stage procedure for hypothesis testing, where the first stage is conventional hypothesis testing and the second is an equivalence testing procedure using an introduced empirical equivalence bound (EEB). In 2016, the American Statistical Association released a policy statement on P-values to clarify the proper use and interpretation in response to the criticism of reproducibility and replicability in scientific findings. A recent solution to improve reproducibility and transparency in statistical hypothesis testing is to integrate P-values (or confidence intervals) with practical or scientific significance. Similar ideas have been proposed via the equivalence test, where the goal is to infer equality under a presumption (null) of inequality of parameters. However, the definition of scientific significance/equivalence can sometimes be ill-justified and subjective. To circumvent this drawback, we introduce the B-value and the EEB, which are both estimated from the data. Performing a second-stage equivalence test, our procedure offers an opportunity to improve the reproducibility of findings across studies.

Keywords: empirical equivalence bound; equivalence test; hypothesis testing.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Conclusion of an equivalence test with different prespecified equivalence bounds when (a) the confidence interval covers zero and (b) the confidence interval does not cover zero. Two possible conclusions under scenario (a): (i) inconclusive result and (ii) equivalence. Three possible conclusions under scenario (b): (i) significant difference, (ii) inconclusive result, and (iii) equivalence.
FIGURE 2
FIGURE 2
A two-stage testing procedure comparing two means.
FIGURE 3
FIGURE 3
Marginal and conditional distribution of the B-value and the corresponding EEB at various β levels in the example.
FIGURE 4
FIGURE 4
(a) The marginal and conditional distribution of the B-value and (b) the marginal and conditional EEB at various β levels in the example.
FIGURE 5
FIGURE 5
Implementation of the two-stage testing procedure on the example data with α = 0.05 and β = 0.95. The black solid lines in Step 1 are the 95% confidence intervals, the black solid lines in Step 2 are the 90% confidence intervals, and the gray dashed lines are the equivalence intervals with margin level at EEBα(β | C).
FIGURE 6
FIGURE 6
(a)&(c) Data scatter plot and the 95% confidence interval (CI) of each species. (b)&(d) The conditional empirical equivalence interval with β = 0.95 and the 90% CI of each pair-wise comparison.
FIGURE 7
FIGURE 7
(a) Scatter plot and 95% confidence interval (CI) of the performance data after adjusting for age, sex, and motor coordination. (b) The conditional empirical equivalence interval with β = 0.95 and the 90% CI of the comparison at each repetition.

References

    1. Walker E, Nowacki AS. Understanding equivalence and noninferiority testing. Journal of general internal medicine 2011; 26(2): 192–196. - PMC - PubMed
    1. Hoenig JM, Heisey DM. The abuse of power: the pervasive fallacy of power calculations for data analysis. The American Statistician 2001; 55(1): 19–24.
    1. Westlake WJ. Use of confidence intervals in analysis of comparative bioavailability trials. Journal of Pharmaceutical Sciences 1972; 61(8): 1340–1341. - PubMed
    1. Westlake WJ. Symmetrical confidence intervals for bioequivalence trials. Biometrics 1976: 741–744. - PubMed
    1. Anderson S, Hauck WW. A new procedure for testing equivalence in comparative bioavailability and other clinical trials. Communications in Statistics-Theory and Methods 1983; 12(23): 2663–2692.

Publication types

LinkOut - more resources