Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jul 1;19(3):281-294.
doi: 10.1093/biostatistics/kxx039.

Rank-based two-sample tests for paired data with missing values

Affiliations

Rank-based two-sample tests for paired data with missing values

Youyi Fong et al. Biostatistics. .

Abstract

Two-sample location problem is one of the most encountered problems in statistical practice. The two most commonly studied subtypes of two-sample location problem involve observations from two populations that are either independent or completely paired, but a third subtype can oftentimes occur in practice when some observations are paired and some are not. Partially paired two-sample problems, also known as paired two-sample problems with missing data, often arise in biomedical fields when it is difficult for some invasive procedures to collect data from an individual at both conditions we are interested in comparing. Existing rank-based two-sample comparison procedures for partially paired data, however, do not make efficient use of all available data. In order to improve the power of testing procedures for this problem, we propose several new rank-based test statistics and study their asymptotic distributions and, when necessary, exact variances. Through extensive numerical studies, we show that the best overall power come from the proposed tests based on weighted linear combinations of the test statistics comparing paired data and the test statistics comparing independent data, using weights inversely proportional to their variances. We illustrate the proposed methods with a real data example from HIV research for prevention.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Boxplots of explant infectivity data. Paired observations between inner and outer foreskin samples are connected with lines. Unpaired observations are represented by shaded circles.

Comment in

  • Letter to the editor.
    Guo X, Gao Y, Niu C, Zhang S. Guo X, et al. Biostatistics. 2019 Apr 1;20(2):358-362. doi: 10.1093/biostatistics/kxy047. Biostatistics. 2019. PMID: 30165542 No abstract available.
  • Response to Guo et al.'s Letter to the Editor.
    Fong Y, Huang Y, Lemos MP, Mcelrath MJ. Fong Y, et al. Biostatistics. 2019 Apr 1;20(2):363-365. doi: 10.1093/biostatistics/kxy061. Biostatistics. 2019. PMID: 30590447 Free PMC article. No abstract available.

References

    1. Akritas M. G., Antoniou E. S. and Kuha J. (2006). Nonparametric analysis of factorial designs with random missingness: bivariate data. Journal of the American Statistical Association 101, 1513–1526.
    1. Akritas M. G., Kuha J. and Osgood D. W. (2002). A nonparametric approach to matched pairs with missing data. Sociological Methods & Research 30, 425–454.
    1. Blair R. C. and Higgins J. J. (1980). A comparison of the power of Wilcoxon’s rank-sum statistic to that of Student’s t statistic under various nonnormal distributions. Journal of Educational and Behavioral Statistics, 5, 309–335.
    1. Brunner E., Domhof S. and Langer F. (2002). Nonparametric analysis of longitudinal data in factorial experiments, Wiley Series in Probability and Statistics New York: Wiley.
    1. Brunner E. and Puri M. L. (1996). Nonparametric methods in design and analysis of experiments. Handbook of Statistics, 13, 631–703.

Publication types

LinkOut - more resources