Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 12;15(1):5158.
doi: 10.1038/s41598-025-89204-9.

Instrumental variable estimation for compositional treatments

Affiliations

Instrumental variable estimation for compositional treatments

Elisabeth Ailer et al. Sci Rep. .

Abstract

Many scientific datasets are compositional in nature. Important biological examples include species abundances in ecology, cell-type compositions derived from single-cell sequencing data, and amplicon abundance data in microbiome research. Here, we provide a causal view on compositional data in an instrumental variable setting where the composition acts as the cause. First, we crisply articulate potential pitfalls for practitioners regarding the interpretation of compositional causes from the viewpoint of interventions and warn against attributing causal meaning to common summary statistics such as diversity indices in microbiome data analysis. We then advocate for and develop multivariate methods using statistical data transformations and regression techniques that take the special structure of the compositional sample space into account while still yielding scientifically interpretable results. In a comparative analysis on synthetic and real microbiome data we show the advantages and limitations of our proposal. We posit that our analysis provides a useful framework and guidance for valid and informative cause-effect estimation in the context of compositional data.

Keywords: Causality; Cause-effect estimation; Compositional data; Instrumental variable; Microbial diversity.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: No competing interest is declared.

Figures

Fig. 1
Fig. 1
The ternary plot shows an exemplary scenario with formula image. The orange contour contains compositions for which the Simpson diversity is constant, while the blue contour shows compositions for which the Shannon diversity is constant. Shannon diversity changes along contours of Simpson diversity and vice versa.
Fig. 2
Fig. 2
Cause-effect estimation of formula image via an instrumental variable Z for compositional X.
Fig. 3
Fig. 3
Visualization of a Setting A (formula image): The left panel shows both a weak (left) and a strong (right) instrument, i.e. with the instrument value Z either barely influencing the the composition of the microbiome (left) or strongly impacting the composition of the microbiome (right). The right panel shows a discrepancy between the true causal effect and the observed effect which stems from a confounding factor.
Fig. 4
Fig. 4
Boxplots of the results for setting B in Table 2 with formula image. We show OOS MSE (left), recovery of non-zero formula image coefficients (middle), and recovery of zero formula image coefficients.
Fig. 5
Fig. 5
We display OOS MSE and formula image-MSE (for non-zero coefficients and where applicable) for our robustness checks. All results and further visualizations are in Supplementary Material S8.
Fig. 6
Fig. 6
Taxonomic tree of the microbiome data at genus level. The influential log-ratios for both Only Second LC and ILR+LC are highlighted in black and blue, respectively.

References

    1. Aitchison, J. The statistical analysis of compositional data. J. Roy. Stat. Soc.: Ser. B (Methodol.)44, 139–160 (1982).
    1. Blei, D. M. & Lafferty, J. D. Correlated topic models. Advances in Neural Information Processing Systems 147–154 (2005).
    1. Blei, D. M. & Lafferty, J. D. A correlated topic model of Science. Ann. Appl. Stat.1, 17–35. 10.1214/07-aoas114 (2007) arXiv:0708.3601v2.
    1. Rozenblatt-Rosen, O., Stubbington, M. J., Regev, A. & Teichmann, S. A. The human cell atlas: From vision to reality. Nat. News550, 451 (2017). - PubMed
    1. Turnbaugh, P. J. et al. The human microbiome project. Nature449, 804–810 (2007). - PMC - PubMed

Publication types

LinkOut - more resources