Randomization-Based Statistical Inference: A Resampling and Simulation Infrastructure
- PMID: 30270947
- PMCID: PMC6155997
- DOI: 10.1111/test.12156
Randomization-Based Statistical Inference: A Resampling and Simulation Infrastructure
Abstract
Statistical inference involves drawing scientifically-based conclusions describing natural processes or observable phenomena from datasets with intrinsic random variation. There are parametric and non-parametric approaches for studying the data or sampling distributions, yet few resources are available to provide integrated views of data (observed or simulated), theoretical concepts, computational mechanisms and hands-on utilization via flexible graphical user interfaces. We designed, implemented and validated a new portable randomization-based statistical inference infrastructure (http://socr.umich.edu/HTML5/Resampling_Webapp) that blends research-driven data analytics and interactive learning, and provides a backend computational library for managing large amounts of simulated or user-provided data. The core of this framework is a modern randomization webapp, which may be invoked on any device supporting a JavaScript-enabled web-browser. We demonstrate the use of these resources to analyze proportion, mean, and other statistics using simulated (virtual experiments) and observed (e.g., Acute Myocardial Infarction, Job Rankings) data. Finally, we draw parallels between parametric inference methods and their distribution-free alternatives. The Randomization and Resampling webapp can be used for data analytics, as well as for formal, in-class and informal, out-of-the-classroom learning and teaching of different scientific concepts. Such concepts include sampling, random variation, computational statistical inference and data-driven analytics. The entire scientific community may utilize, test, expand, modify or embed these resources (data, source-code, learning activity, webapp) without any restrictions.
Keywords: Statistics Online Computational Resource (SOCR); bootstrapping; randomization; resampling; simulation; statistical inference.
Figures




References
-
- Aronow P, Samii C. RI: R package for performing randomization-based inference for experiments. 2014 Retrieved from http://cran.r-project.org/web/packages/ri/ri.pdf.
-
- Barber JA, Thompson SG. Analysis of cost data in randomized trials: an application of the non-parametric bootstrap. Statistics in Medicine. 2000;19(23):3219–3236. - PubMed
-
- Barker T. Pro Data Visualization using R and JavaScript. Springer; 2013. Data Visualization with D3; pp. 65–84.
-
- Budgett S, Pfannkuch M, Regan M, Wild CJ. Dynamic visualizations and the randomization test. Technology Innovations in Statistics Education. 2013;7(2)
Grants and funding
LinkOut - more resources
Full Text Sources