Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul-Aug;59(4):738-757.
doi: 10.1080/00273171.2024.2317928. Epub 2024 Apr 8.

Network Inference With the Lasso

Affiliations

Network Inference With the Lasso

Lourens Waldorp et al. Multivariate Behav Res. 2024 Jul-Aug.

Abstract

Calculating confidence intervals and p-values of edges in networks is useful to decide their presence or absence and it is a natural way to quantify uncertainty. Since lasso estimation is often used to obtain edges in a network, and the underlying distribution of lasso estimates is discontinuous and has probability one at zero when the estimate is zero, obtaining p-values and confidence intervals is problematic. It is also not always desirable to use the lasso to select the edges because there are assumptions required for correct identification of network edges that may not be warranted for the data at hand. Here, we review three methods that either use a modified lasso estimate (desparsified or debiased lasso) or a method that uses the lasso for selection and then determines p-values without the lasso. We compare these three methods with popular methods to estimate Gaussian Graphical Models in simulations and conclude that the desparsified lasso and its bootstrapped version appear to be the best choices for selection and quantifying uncertainty with confidence intervals and p-values.

Keywords: bootstrap; debiased lasso; desparsified lasso; multisplit method; p-values for lasso.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(a) Example of a network with four nodes 1, 2, 3 and 4 with associated variables X1, X2, X3, and X4. The edges X1X2, X1X3 and X2X4 correspond to the conditional dependence statements, and the absence of edges correspond to conditional independence statements: X2 ⊥⊥X3|X1, X1⊥⊥X4|X2 and X3⊥⊥X4|X1,X2. (b) The variable X4 is not part of the neighborhood of X1, while X2 and X3 are part of the neighborhood of X1.
Figure 2.
Figure 2.
Top panel: bootstrapped sampling distributions of lasso estimates of partial correlations ρ=0,0.2,0.5. Bottom panel: the corresponding bootstrapped sampling distributions based on unbiased least squares estimates. The dashed vertical line indicates the true parameter.
Figure 3.
Figure 3.
Sensitivity (first row), precision (second row), and coverage (third row) for edge probabilities 0.2 (left column) and 0.4 (right column), as a function of sample size (x-axis). The dashed horizontal line at 0.95 in the last row indicates the desired coverage based on the chosen threshold α=0.05.
Figure 4.
Figure 4.
Significant partial correlations in the PTSD data; blue/red edges correspond to positive/negative partial correlations; the width of edges is a function of the absolute value of the associated parameter.
Figure 5.
Figure 5.
The point estimates and confidence intervals of the 50 partial correlations with the largest point estimate. Hollow points represent estimates whose CIs overlap with zero.
Figure I.1.
Figure I.1.
Specificity (first row), coverage for absent edges (second row), and coverage for present edges (third row) for edge probabilities 0.2 (left column) and 0.4 (right column), as a function of sample size (x-axis).

References

    1. Bertsimas, D., King, A., & Mazumder, R. (2016). Best subset selection via a modern optimization lens. The Annals of Statistics, 44(2), 813–852. 10.1214/15-AOS1388 - DOI
    1. Bilodeau, M., & Brenner, D. (1999). Theory of multivariate statistics. Springer-Verlag.
    1. Blanken, T. F., Borsboom, D., Penninx, B. W., & Van Someren, E. J. (2020). Network outcome analysis identifies difficulty initiating sleep as a primary target for prevention of depression: A 6-year prospective study. Sleep, 43(5), zsz288. 10.1093/sleep/zsz288 - DOI - PMC - PubMed
    1. Boccaletti, S., Latora, V., Moreno, Y., Chavez, M., & Hwang, D. (2006). Complex networks: Structure and dynamics. Physics Reports, 424(4-5), 175–308. 10.1016/j.physrep.2005.10.009 - DOI
    1. Borsboom, D., Cramer, A. O. J., Schmittmann, V. D., Epskamp, S., & Waldorp, L. J. (2011). The small world of psychopathology. PloS One, 6(11), e27407. 10.1371/journal.pone.0027407 - DOI - PMC - PubMed

LinkOut - more resources