Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov 13;4(11):100671.
doi: 10.1016/j.xgen.2024.100671. Epub 2024 Oct 11.

Gene regulatory network inference from CRISPR perturbations in primary CD4+ T cells elucidates the genomic basis of immune disease

Affiliations

Gene regulatory network inference from CRISPR perturbations in primary CD4+ T cells elucidates the genomic basis of immune disease

Joshua S Weinstock et al. Cell Genom. .

Abstract

The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation has been challenging due to small effects, but experimental perturbations offer a complementary approach. Using CRISPR, we knocked out 84 genes in primary CD4+ T cells, targeting inborn error of immunity (IEI) disease transcription factors (TFs) and TFs without immune disease association. We developed a novel gene network inference method called linear latent causal Bayes (LLCB) to estimate the network from perturbation data and observed 211 regulatory connections between genes. We characterized programs affected by the TFs, which we associated with immune genome-wide association study (GWAS) genes, finding that JAK-STAT family members are regulated by KMT2A, an epigenetic regulator. These analyses reveal the trans-regulatory cascades linking GWAS genes to signaling pathways.

Keywords: CRISPR; GWAS; RNA-seq; T cells; gene regulatory networks; immunology; inbornb errors of immunity; network inference.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests A.M. is a cofounder of Site Tx, Arsenal Biosciences, Spotlight Therapeutics, and Survey Genomics; serves on the boards of directors at Site Tx, Spotlight Therapeutics, and Survey Genomics; is a member of the scientific advisory boards of Site Tx, Arsenal Biosciences, Cellanome, Spotlight Therapeutics, Survey Genomics, NewLimit, Amgen, and Tenaya; owns stock in Arsenal Biosciences, Site Tx, Cellanome, Spotlight Therapeutics, NewLimit, Survey Genomics, Tenaya, and Lightcast; and has received fees from Site Tx, Arsenal Biosciences, Cellanome, Spotlight Therapeutics, NewLimit, Gilead, Pfizer, 23andMe, PACT Pharma, Juno Therapeutics, Tenaya, Lightcast, Trizell, Vertex, Merck, Amgen, Genentech, GLG, ClearView Healthcare, AlphaSights, Rupert Case Management, Bernstein, and ALDA. A.M. is an investor in and informal advisor to Offline Ventures and a client of EPIQ. The Marson laboratory has received research support from the Parker Institute for Cancer Immunotherapy, the Emerson Collective, Arc Institute, Juno Therapeutics, Epinomics, Sanofi, GlaxoSmithKline, Gilead, and Anthem and reagents from Genscript and Illumina. J.W.F. was a consultant for NewLimit, is an employee of Genentech, and has equity in Roche. A.B. is a stockholder in Alphabet, Inc., and a consultant for Third Rock Ventures. J.S.W. was a consultant to Spiral Genetics.

Figures

None
Graphical abstract
Figure 1
Figure 1
Study overview Schematic describing the three gene sets that were perturbed with CRISPR knockouts and modeling of the gene network, network inference analyses, and gene module identification and integration with immune GWAS data.
Figure 2
Figure 2
The gene network of the 84 perturbed genes (A) Estimate of the directed network that describes how the 84 perturbed genes interact. The radius of each point is proportional to the degree of that gene. Arrows are used to indicate directionality of the edges, such that an arrow pointing into a gene indicates that it is being regulated by another gene. For emphasis, the opacity of the edges from or to inborn error of immunity transcription factors is increased, and all other edges are displayed with greater transparency. Positive values in the color scale indicate that the parent gene is a positive regulator of the child gene. (B) A sub-network centered around STAT1. (C) A scatterplot of the indegree and outdegree of each of the 84 genes. (D) Association analyses between gene properties and their indegree, outdegree, and total degree.
Figure 3
Figure 3
The landscape of downstream effects (A) The statistical model used to relate the 84 perturbed genes to the expressed genes. (B) The distribution of the number of downstream effects for each of the 84 genes, stratified by gene group. Genes that are outliers with respect to their gene group distribution are labeled. (C) The distribution of indegree for each of the non-perturbed genes. Outlier genes are labeled. (D) Association between the properties of downstream genes and the gene set of the upstream regulators. Coefficients are estimated with negative binomial regressions of the gene-set-specific indegree. Downstream gene annotations are indicated on the y axis, and the facets are used to indicate the gene set of the upstream regulator. ∗The 306 denotes the total number of RNA-seq observations, which includes 84 genes perturbed in three donors and 54 samples from control guides.
Figure 4
Figure 4
The discovery of gene modules Hierarchical clustering is used to identify clusters of shared downstream effects. The upstream gene members within each module are labeled in the left-handed margin of the plot, and the gene group of each gene is indicated by the text color. The total number of genes in the module, including both upstream and downstream effects, is included under the list of genes.
Figure 5
Figure 5
Gene module characterization (A) Enrichment analyses of KEGG genetic, immune, and signaling pathways for each of the 84 perturbed genes, stratified by gene module. The JAK-STAT pathway is highlighted with a dashed red box. The color bar maximum is set to 4. (B) The JAK-STAT sub-network, which is organized such that cytokine genes are at the bottom and upstream regulators are at the top. (C) Effects of KOs in the gene modules on a proliferation assay. Each point represents an individual gene perturbation sample plotted as the log2 fold change sample count as compared to AAVS1 KO control samples from the same donor (∗p < 0.05 and ∗∗∗∗p < 0.001; n = 3 donors per KO, the number of KOs per cluster is reflected in Figure 4).
Figure 6
Figure 6
Autoimmune and allergy association of SNPs linked with the gene modules (A) Estimated τ coefficients from LD score regression are plotted for each gene module and phenotype. Module 0 is defined as genes that were not included in any module but are still expressed in CD4+ T cells. (B) Exemplar analysis annotating the fine-mapped genes from a Finngen dermatitis GWAS based on their presence in module 2A.
Figure 7
Figure 7
The transcriptional logic linking module 4 to GWAS loci (A) The sub-network of module 4 and Th17 cytokines. (B) Locus plot including tracks describing the functional characteristics of the region. Each track is constructed from publicly available ChIP-seq data (STAR Methods) or ATAC-seq data from Freimer et al. Gray boxes indicate significantly different regions between the respective KO and AAVS1 control KO ATAC data (adjusted p value [padj] < 0.05, n = 3 donors per KO). The y axis displays normalized counts.
None

Update of

References

    1. Maurano M.T., Humbert R., Rynes E., Thurman R.E., Haugen E., Wang H., Reynolds A.P., Sandstrom R., Qu H., Brody J., et al. Systematic Localization of Common Disease-Associated Variation in Regulatory DNA. Science. 2012;337:1190–1195. doi: 10.1126/science.1222794. - DOI - PMC - PubMed
    1. Finucane H.K., Bulik-Sullivan B., Gusev A., Trynka G., Reshef Y., Loh P.-R., Anttila V., Xu H., Zang C., Farh K., et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 2015;47:1228–1235. doi: 10.1038/ng.3404. - DOI - PMC - PubMed
    1. GTEx Consortium. Laboratory Data Analysis &Coordinating Center LDACC—Analysis Working Group. Statistical Methods groups—Analysis Working Group. Enhancing GTEx eGTEx groups. NIH Common Fund. Biospecimen Collection Source Site—NDRI. Jo B., Mohammadi P., Park Y., Parsana P., et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–213. doi: 10.1038/nature24277. - DOI - PMC - PubMed
    1. GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. - DOI - PMC - PubMed
    1. Võsa U., Claringbould A., Westra H.-J., Bonder M.J., Deelen P., Zeng B., Kirsten H., Saha A., Kreuzhuber R., Yazar S., et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat. Genet. 2021;53:1300–1310. doi: 10.1038/s41588-021-00913-z. - DOI - PMC - PubMed

MeSH terms

Substances