Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 15;133(10):e166919.
doi: 10.1172/JCI166919.

Genetic predisposition to neuroblastoma results from a regulatory polymorphism that promotes the adrenergic cell state

Affiliations

Genetic predisposition to neuroblastoma results from a regulatory polymorphism that promotes the adrenergic cell state

Nina Weichert-Leahey et al. J Clin Invest. .

Abstract

Childhood neuroblastomas exhibit plasticity between an undifferentiated neural crest-like mesenchymal cell state and a more differentiated sympathetic adrenergic cell state. These cell states are governed by autoregulatory transcriptional loops called core regulatory circuitries (CRCs), which drive the early development of sympathetic neuronal progenitors from migratory neural crest cells during embryogenesis. The adrenergic cell identity of neuroblastoma requires LMO1 as a transcriptional cofactor. Both LMO1 expression levels and the risk of developing neuroblastoma in children are associated with a single nucleotide polymorphism, G/T, that affects a GATA motif in the first intron of LMO1. Here, we showed that WT zebrafish with the GATA genotype developed adrenergic neuroblastoma, while knock-in of the protective TATA allele at this locus reduced the penetrance of MYCN-driven tumors, which were restricted to the mesenchymal cell state. Whole genome sequencing of childhood neuroblastomas demonstrated that TATA/TATA tumors also exhibited a mesenchymal cell state and were low risk at diagnosis. Thus, conversion of the regulatory GATA to a TATA allele in the first intron of LMO1 reduced the neuroblastoma-initiation rate by preventing formation of the adrenergic cell state. This mechanism was conserved over 400 million years of evolution, separating zebrafish and humans.

Keywords: Genetic variation; Genetics; Molecular genetics; Oncology.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest: RAY is a founder and shareholder of Syros Pharmaceuticals, which is discovering and developing therapeutics directed at transcriptional pathways in cancer. BJA is a shareholder of Syros Pharmaceuticals. ATL is a founder and shareholder of Light Horse Therapeutics, which is discovering and developing small molecules to disrupt oncogenic protein complexes. JMM is a founder of Tantigen BIO and HULA Therapeutics, 2 companies focused on discovery and development of immunotherapies for childhood cancers. JKJ has, or had during the course of this research, financial interests in several companies developing gene editing technology, including Beam Therapeutics, Blink Therapeutics, Chroma Medicine, Editas Medicine, EpiLogic Therapeutics, Excelsior Genomics, Hera Biolabs, Horizon Discovery, Monitor Biotechnologies, Nvelop Therapeutics (f/k/a/ ETx Inc.), Pairwise Plants, Poseida Therapeutics, SeQure Dx Inc., Transposagen Biopharmaceuticals, and Verve Therapeutics. JKJ’s interests were reviewed and are managed by Massachusetts General Hospital and Mass General Brigham in accordance with their conflict-of-interest policies. JKJ is a coinventor on various patents and patent applications (CA2841710C, CA2907198C, US20140201858A1, WO2018071892A1, US20120178647A1, CA3059956A1, CA3063733A1, CA3031414A1, CA2996888A1, and CA2978314A1) that describe gene editing and epigenetic editing technologies. No other potential conflicts of interest are declared.

Figures

Figure 1
Figure 1. Evolutionary history of the G/T polymorphism at rs2168101.
(A) Phylogenetic tree representing the evolutionary relationship between the LMO1 genes from the indicated species over the last 400 million years. Blue font denotes species that exclusively harbor a G at rs2168101. Orange font denotes that only humans demonstrate a G/T polymorphism at the rs2168101 locus. (B) Distribution of the G and T alleles of rs2168101 across Human Genome Diversity Project (HGDP) populations, as illustrated by their genome browser (http://popgen.uchicago.edu/ggv/). Circles create a pie chart in which blue represents the proportion of human populations from different parts of the world with a G at rs2168101 (human chromosome 11, position 8255408), and orange represents the proportion with a T. (C) Shown is a modified UCSC Genome Browser (https://genome.ucsc.edu/) window of the human LMO1 locus indicating the 2 alternative transcription start sites and the position of rs2168101 in the first intron (top), a vertebrate conservation track graphing PhyloP conservation scores (middle), and Multiz alignments of multiple vertebrate species (bottom), illustrating a high level of conservation in the noncoding region surrounding rs2168101. (D) The immediate sequence neighborhood surrounding rs2168101 in the first intron of LMO1 from multiple species is shown. The G at rs2168101 in the human consensus sequence is marked with a red box.
Figure 2
Figure 2. The development of MYCN-driven neuroblastomas is impaired in the TATA/TATA and lmo1-null backgrounds.
(A) Diagram illustrating the construction of the lmo1 GATA/TATA (GATA/TATA) zebrafish line in which TALEN-mediated gene editing was used to replace the G at rs2168101 with a T. The rs2168101 G resides within the first intron of the zebrafish lmo1 gene (exons 1 and 2 are denoted by blue boxes) and creates the first nucleotide of a GATA DNA-binding sequence (in bold). To facilitate the precise genome editing and knock-in of the T allele at this locus, we used TALEN gene-editing technology targeting the sequences flanking the G at rs2168101 (as indicated in red and green) together with a single-stranded DNA oligonucleotide containing a T instead of the G with short flanking homology arms of 20 nucleotides (TATA-ssOligo). To prevent TALEN binding to the 5′ arm and activity after successful knock-in of the TATA-ssOligo and to aid in the identification of embryos containing the modified sequence, the TATA-ssOligo was designed with 2 additional nucleotide changes (CC to replace TT in the 5′ homology arm, marked in bold) to create a new restriction site for TfiI (see also Supplemental Figure 3). (B) To analyze the effect of the rs2168101 G T substitution on MYCN-induced neuroblastoma, compound transgenic zebrafish lines were created by crossing the transgenic lines Tg(dβh:MYCN) and Tg(dβh:EGFP) with the GATA/TATA knock-in line, as illustrated. The dβh:EGFP and dβh:MYCN lines, in which the zebrafish dβh promoter was used to facilitate tissue-specific expression of EGFP and MYCN, were established previously (24). (C) Representative fluorescent images of adult zebrafish showing EGFP-expressing tumors arising in the indicated transgenic lines. (D and E) Starting at 5 weeks postfertilization (wpf), zebrafish with the indicated genotypes were monitored biweekly for the presence of tumors by EGFP fluorescence microscopy. The graph shows a Kaplan-Meier analysis of the cumulative frequency of neuroblastomas in the transgenic lines. Statistical analysis was performed using the logrank test.
Figure 3
Figure 3. Expression of the lmo family genes in zebrafish and human neuroblastoma.
(A) RNA-Seq analysis was performed to measure the relative mRNA expression of lmo family genes in neuroblastomas arising in zebrafish with the indicated genotypes (GATA/GATA, n = 4; lmo1–/–, n = 3; and TATA/TATA, n = 2). mRNA expression levels for the indicated lmo family genes are represented by FPKM log-scale values. Expression under 1 FPKM is considered as nonexpressed. Statistical analysis was performed using the 2-tailed, unpaired t test. *P < 0.005. (B) Relative LMO1-4 mRNA expression levels measured by RNA-Seq in 153 primary human neuroblastoma samples (from TARGET), ranked from highest (left) to lowest levels of LMO1 expression (in FPKM). Expression correlation analysis demonstrated weak inverse correlation between LMO1 and the other 3 LMO family members (R ≥ –0.3, P < 0.05).
Figure 4
Figure 4. Lmo1 coregulates transcription factors that comprise the adrenergic neuroblastoma CRC.
(A) Heatmap image based on RNA-Seq data analysis showing differentially expressed genes in MYCN-induced neuroblastoma tumors arising in lmo1 GATA/GATA (WT), lmo1–/– and lmo1 TATA/TATA backgrounds categorized into 6 groups, as indicated. Each row corresponds to a gene, and signal intensity is normalized across the row. Genes were rank ordered from highest (right side of the map) to lowest (left side of the map) based on fold change of gene expression in TATA/TATA or lmo1–/– compared to GATA/GATA. (B) Heatmap representing gene expression changes of the known adrenergic neuroblastoma CRC transcription factors isl1, gata3, ascl1, phox2b, and hand2 in MYCN-induced neuroblastoma tumors arising in the GATA/GATA, lmo1–/– and TATA/TATA backgrounds. (C) LMO1 mRNA expression (TPM+1) violin plots retrieved from the 21Q1 release of Depmap (depmap.org) (18), from 23 neuroblastoma cell lines. Cell lines were defined as adrenergic (ADRN) (n = 18) or mesenchymal (MES) (n = 5) subtypes based on general gene expression profiles. Red bars indicate mean, dotted line indicates a TPM+1 of 1. P = 0.0188 by student’s t test.
Figure 5
Figure 5. MYCN-driven neuroblastomas from the TATA/TATA line, but not the lmo1–/– line, express the mesenchymal CRC.
(A) Heatmap image based on RNA-Seq data analysis showing the indicated genes of the mesenchymal CRC, which are upregulated in the MYCN-induced neuroblastomas arising in the TATA/TATA background, but not in the lmo1–/– background, compared with the GATA/GATA zebrafish. (B) Relative mRNA expression levels of the mesenchymal neuroblastoma phenotype marker fn1a in neuroblastoma arising in zebrafish with the indicated genotypes (GATA/GATA, n = 4; lmo1–/–, n = 3; and TATA/TATA, n = 2). (C) Relative mRNA expression levels of NOTCH pathway genes, including jag1a, notch2, and hes2, in neuroblastomas arising in zebrafish from the indicated genotypes (GATA/GATA, n = 4; lmo1–/–, n = 3; and TATA/TATA, n = 2). (D) Heatmap of transcription factors exclusively upregulated in MYCN-driven neuroblastomas from the lmo1–/– line. TFs; transcription factors.
Figure 6
Figure 6. Human TATA/TATA neuroblastomas resemble zebrafish TATA/TATA tumors with low LMO1 expression and a mesenchymal phenotype.
(A) Relative LMO1 mRNA expression levels (in FPKM) in 124 human neuroblastoma samples with the indicated genotypes at rs2168101: TATA/TATA, GATA/TATA, or GATA/GATA. Each neuroblastoma sample is assigned to either low (pink), intermediate (light red) or high risk (purple). Statistical analysis was performed using the 2-tailed, Welch’s t test. *P < 0.005. (B and C) Adrenergic and mesenchymal gene set signature scores were generated using GSVA based on the previously published expression profiles by Von Groningen, et al. (8). Higher positive scores indicate upregulation of the corresponding signature, whereas, lower negative scores indicate downregulation. GATA/GATA and GATA/TATA tumors were combined into 1 group (blue) and compared with the TATA/TATA tumors (tan). (D and E) UMAP representing the whole transcriptome landscape of 124 human neuroblastoma samples (dots) in a 2-dimensional space combined with the adrenergic (D) and mesenchymal (E) gene signatures. Relative Z-score-transformed expression for each signature (according to the heat scale) is shown for each tumor (points) and overall density (contours). Clusters 1–4 represent tumors falling into similar density contours based on the adrenergic signature (D) or mesenchymal signature (E). (FH) Combination of UMAP dimensionality reduction analysis and expression levels of LMO1 (F), LMO2 (G), and LMO3 (H). Relative Z-score of log2-transformed expression for LMO1 (F), LMO2, (G) or LMO3 (H) (according to the heat scale) is shown for each tumor (points) and overall density (contours). (I) MYCN status for all 124 human neuroblastoma samples.

Update of

Comment in

References

    1. Saint-André V, et al. Models of human core transcriptional regulatory circuitries. Genome Res. 2016;26(3):385–396. doi: 10.1101/gr.197590.115. - DOI - PMC - PubMed
    1. Arendt D, et al. The origin and evolution of cell types. Nat Rev Genet. 2016;17(12):744–757. doi: 10.1038/nrg.2016.127. - DOI - PubMed
    1. Martik ML, Bronner ME. Regulatory logic underlying diversification of the neural crest. Trends Genet. 2017;33(10):715–727. doi: 10.1016/j.tig.2017.07.015. - DOI - PMC - PubMed
    1. Matthay KK, et al. Neuroblastoma. Nat Rev Dis Primer. 2016;2:16078. doi: 10.1038/nrdp.2016.78. - DOI - PubMed
    1. Zimmerman MW, et al. MYC drives a subset of high-risk pediatric neuroblastomas and is activated through mechanisms including enhancer hijacking and focal enhancer amplification. Cancer Discov. 2018;8(3):320–335. doi: 10.1158/2159-8290.CD-17-0993. - DOI - PMC - PubMed

Publication types

Substances