Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep 15;26(18):i517-23.
doi: 10.1093/bioinformatics/btq377.

Discovering graphical Granger causality using the truncating lasso penalty

Affiliations

Discovering graphical Granger causality using the truncating lasso penalty

Ali Shojaie et al. Bioinformatics. .

Abstract

Motivation: Components of biological systems interact with each other in order to carry out vital cell functions. Such information can be used to improve estimation and inference, and to obtain better insights into the underlying cellular mechanisms. Discovering regulatory interactions among genes is therefore an important problem in systems biology. Whole-genome expression data over time provides an opportunity to determine how the expression levels of genes are affected by changes in transcription levels of other genes, and can therefore be used to discover regulatory interactions among genes.

Results: In this article, we propose a novel penalization method, called truncating lasso, for estimation of causal relationships from time-course gene expression data. The proposed penalty can correctly determine the order of the underlying time series, and improves the performance of the lasso-type estimators. Moreover, the resulting estimate provides information on the time lag between activation of transcription factors and their effects on regulated genes. We provide an efficient algorithm for estimation of model parameters, and show that the proposed method can consistently discover causal relationships in the large p, small n setting. The performance of the proposed model is evaluated favorably in simulated, as well as real, data examples.

Availability: The proposed truncating lasso method is implemented in the R-package 'grangerTlasso' and is freely available at http://www.stat.lsa.umich.edu/~shojaie/.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Mean and standard deviation of performance criteria for lasso, Alasso, Tlasso and TAlasso in estimation of graphical Granger model, with p = 100, n = 50 and d = 2. Top: T = 10, Bottom: T = 20.
Fig. 2.
Fig. 2.
Images of the adjacency matrix of the true graph, and estimates from lasso, Alasso, Tlasso and TAlasso. Images on the left correspond to the adjacency matrices of graphical Granger models (true and estimates) over time, while images on the right represent the cumulative graphical model (the network structure). In the left panel of the true adjacency matrix, a dark pixel in the (i, j)th entry at time t represents an edge from XTtj to XTi. The gray-scale images for the estimates represent percentage of times where an edge is present in 50 simulations. Significant false positives and negatives are marked with rectangles and ovals, respectively.
Fig. 3.
Fig. 3.
Known transcription regulatory network of E.coli along with estimates based on Alasso, TAlasso and grpLasso. True edges (true positives in estimated networks) are marked with solid black arrows, while false positives are indicated by dashed red arrows.
Fig. 4.
Fig. 4.
Known BioGRID network of human HeLa cell genes along with the estimates based on TAlasso, grpLasso and CNET. True edges (true positives in estimated networks) are marked with solid black arrows, while false positives are indicated by dashed red arrows.

References

    1. Arnold A, et al. Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. New York, NY, USA: ACM; 2007. Temporal causal modeling with graphical granger methods; pp. 66–75.
    1. de Leeuw J. Block-relaxation algorithms in statistics. In: Bock HH, et al., editors. Information System and Data Analysis. 1994. pp. 308–325.
    1. Friedman J, et al. Regularization paths for generalized linear models via coordinate descent. J. Stat. Soft. 2008;33 - PMC - PubMed
    1. Fujita A, et al. Modeling gene expression regulatory networks with the sparse vector autoregressive model. BMC Systems Biol. 2007;1:39. - PMC - PubMed
    1. Granger C. Investigating causal relations by econometric models and cross-spectral methods. Econometrica. 1969;37:424–438.

Publication types