Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jun 15;30(12):i139-48.
doi: 10.1093/bioinformatics/btu293.

Graph-regularized dual Lasso for robust eQTL mapping

Affiliations

Graph-regularized dual Lasso for robust eQTL mapping

Wei Cheng et al. Bioinformatics. .

Abstract

Motivation: As a promising tool for dissecting the genetic basis of complex traits, expression quantitative trait loci (eQTL) mapping has attracted increasing research interest. An important issue in eQTL mapping is how to effectively integrate networks representing interactions among genetic markers and genes. Recently, several Lasso-based methods have been proposed to leverage such network information. Despite their success, existing methods have three common limitations: (i) a preprocessing step is usually needed to cluster the networks; (ii) the incompleteness of the networks and the noise in them are not considered; (iii) other available information, such as location of genetic markers and pathway information are not integrated.

Results: To address the limitations of the existing methods, we propose Graph-regularized Dual Lasso (GDL), a robust approach for eQTL mapping. GDL integrates the correlation structures among genetic markers and traits simultaneously. It also takes into account the incompleteness of the networks and is robust to the noise. GDL utilizes graph-based regularizers to model the prior networks and does not require an explicit clustering step. Moreover, it enables further refinement of the partial and noisy networks. We further generalize GDL to incorporate the location of genetic makers and gene-pathway information. We perform extensive experimental evaluations using both simulated and real datasets. Experimental results demonstrate that the proposed methods can effectively integrate various available priori knowledge and significantly outperform the state-of-the-art eQTL mapping methods.

Availability: Software for both C++ version and Matlab version is available at http://www.cs.unc.edu/∼weicheng/.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Examples of prior knowledge on genetic-interaction network S and gene–gene interactions represented by PPI network or gene co-expression network G. W is the regression coefficients to be learned
Fig. 2.
Fig. 2.
Ground truth of matrix W and that estimated by different methods. The x-axis represents traits and y-axis represents SNPs. Normalized absolute values of regression coefficients are used. Darker color implies stronger association
Fig. 3.
Fig. 3.
The ground truth networks, prior partial networks and the refined networks
Fig. 4.
Fig. 4.
Power curves for synthetic data. The left plots show the ROC curve, where our model GDL achieved maximum power. The black solid line denotes what random guessing would have achieved. The right plots illustrate the areas under the precision-recall curve (AUCs) of different methods
Fig. 5.
Fig. 5.
The areas under the TPR-FPR curve (AUCs) of Lasso, LORS, G-Lasso and GDL. In each panel, we vary the percentage of noises in the prior networks S0 and G0
Fig. 6.
Fig. 6.
The top-1000 significant associations identified by different methods. In each plot, the x-axis represents SNPs and y-axis represents genes. Both SNPs and genes are arranged by their locations in the genome
Fig. 7.
Fig. 7.
Ratio of correct interactions refined when varying κ. The initial input networks only contain 25% correct interactions

References

    1. Biganzoli EM, et al. Artificial neural network for the joint modelling of discrete cause-specific hazards. Artif. Intell. Med. 2006;37:119–130. - PubMed
    1. Bochner BR. New technologies to assess genotype henotype relationships. Nat. Rev. Genet. 2003;4:309–314. - PubMed
    1. Boyd S, Vandenberghe L. Convex Optimization. Cambridge: Cambridge University Press; 2004.
    1. Brem RB, et al. Genetic interactions between polymorphisms that affect gene expression in yeast. Nature. 2005;436:701–703. - PMC - PubMed
    1. Brem YG, et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat. Genet. 2003;35:57–64. - PubMed

Publication types

Substances