Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May;629(8011):384-392.
doi: 10.1038/s41586-024-07292-5. Epub 2024 Apr 10.

Cell-type-resolved mosaicism reveals clonal dynamics of the human forebrain

Affiliations

Cell-type-resolved mosaicism reveals clonal dynamics of the human forebrain

Changuk Chung et al. Nature. 2024 May.

Abstract

Debate remains around the anatomical origins of specific brain cell subtypes and lineage relationships within the human forebrain1-7. Thus, direct observation in the mature human brain is critical for a complete understanding of its structural organization and cellular origins. Here we utilize brain mosaic variation within specific cell types as distinct indicators for clonal dynamics, denoted as cell-type-specific mosaic variant barcode analysis. From four hemispheres and two different human neurotypical donors, we identified 287 and 780 mosaic variants, respectively, that were used to deconvolve clonal dynamics. Clonal spread and allele fractions within the brain reveal that local hippocampal excitatory neurons are more lineage-restricted than resident neocortical excitatory neurons or resident basal ganglia GABAergic inhibitory neurons. Furthermore, simultaneous genome transcriptome analysis at both a cell-type-specific and a single-cell level suggests a dorsal neocortical origin for a subgroup of DLX1+ inhibitory neurons that disperse radially from an origin shared with excitatory neurons. Finally, the distribution of mosaic variants across 17 locations within one parietal lobe reveals that restriction of clonal spread in the anterior-posterior axis precedes restriction in the dorsal-ventral axis for both excitatory and inhibitory neurons. Thus, cell-type-resolved somatic mosaicism can uncover lineage relationships governing the development of the human forebrain.

PubMed Disclaimer

Conflict of interest statement

Competing interests

K.K. is a senior scientist at Bioskryb Genomics Inc. All other authors declare no competing interests.

Figures

Extended Data Fig. 1.
Extended Data Fig. 1.. Tissues collected from ID01 and ID05.
Red dots indicate approximate sites of punch biopsies. Abbreviations: PF, prefrontal cortex; F, frontal cortex; P, parietal cortex; O, occipital cortex; T, temporal cortex; I, insular cortex; Cb, Cerebellum; CC, cingulate cortex; mOC, medial occipital cortex; Cau, Caudate; Put, Putamen; Thal, Thalamus; GP, globus pallidus; EC, entorhinal cortex; HIP, hippocampus; AMG, amygdala; POA, preoptic area; Cl, Claustrum.
Extended Data Fig. 2.
Extended Data Fig. 2.. Bisulfite sequencing in sorted TBR1+ and DLX1+ nuclear pools correlate with excitatory and inhibitory neuron methylome signatures.
(a-c) MFNS gating strategy on 30,000 single brain nuclei using DLX1, TBR1, and COUPTFII antibodies. X-axis: 488 channel intensity for monitoring autofluorescence signals. Y axis: Fluorescence intensity from antigen-bound antibodies. (d) snRNA-seq of post-MFNS nuclei confirming enrichment of targeted nuclear types. (e) Marker expression in assigned nuclear types correlating with targeted nuclear types. (f) UMAP plot NR2F2 expression pattern (encoding COUPTFII) highlighting a subpopulation of inhibitory neurons (compare with Fig. 1c). (g-i) Reference excitatory and inhibitory neuronal methylome signatures (aggregated from an available public single-nuclei methylome dataset) compared to methylome signatures of sorted nuclei and a bulk heart sample. Normalized relative methylation levels (y-axes) and genomic positions (x-axes) of genes listed at top. (g) Methylome signature of SLC17A7 encoding VGLUT1, an excitatory neuronal marker in the brain, showing reduced methylation (i.e. representing activation) across the gene body and especially near the transcription start site (TSS, red box) in TBR1+ excitatory neuron samples. (h) Methylome signature of SLC6A1 encoding VGAT1, an inhibitory neuronal marker in the brain, showing reduced methylation across the gene body and especially near the TSS (red box). (i) Methylome signature of RBFOX3 encoding NEUN, a mature neuronal marker in the brain, showing reduced methylation at the TSS in neurons compared with bulk heart. Ref ExN, reference excitatory neurons; Ref InN, reference inhibitory neurons. (j) Heatmap and dendrograms based on cosine similarities of global methylation patterns between groups. Two different TBR1+ or DLX1+ samples were aggregated. The TBR1+ nuclear pool was clustered with Ref ExN while the DLX1+ clustered near the pool with Ref InN. The control heart bulk sample was distant from either group.
Extended Data Fig. 3.
Extended Data Fig. 3.. Quality controls of MPAS results.
(a-b) Violin plot distribution of log-transformed total read depths (y-axes) of individual variant positions in 321 or 147 samples from ID01 or ID05 (x-axes), respectively. (c-d) Correlation between sqrt-t AF of individual variants from WGS and MPAS. Blue horizontal dashed lines: Lower bound for binomial distribution detection threshold. r and p-values (two-tailed) from Pearson’s Product-Moment correlation. Identity lines (red).
Extended Data Fig. 4.
Extended Data Fig. 4.. Basic characteristics of positively validated MVs from the cMVBA pipeline.
(a) ID01. (b) ID05. CTX, cortex; BG, Basal ganglia; THAL, thalamus; HIP, hippocampus; AMG, amygdala; CB, cerebellum; SUB, subiculum; CLA, Claustrum; POA, preoptic area; OLF, olfactory bulb. (c) Mutational signature analysis using 368 brain-specific sSNVs from ID01 and ID05 using Mutalisk. Clonal sSNVs show clock-like signatures such as SBS 1 and 5, reflecting embryonic developmental origins. (d-e) AF distributions of organ-shared early embryonic MVs in ID01(d) and ID05 (e) reflect the asymmetric clonal division in early human embryos. Vertical dashed lines (red): expected peaks (AF=25%) from the first symmetric cell division, absent in observed distribution, suggesting asymmetric divisions.
Extended Data Fig. 5.
Extended Data Fig. 5.. UMAP relationships between samples from the brain based on AFs of validated MVs.
Clustering by the same hemisphere validates lateralization of brain-derived cell clones except for the independent origin of microglia (marked by PU.1, arrow). (a-c) UMAP clustering in ID01 samples labeled by (a) cell type, (b) gross region, or (c) subregion, respectively. Clustered samples tend to show similar AF patterns. (d-f) UMAP clustering in ID05 samples labeled by (d) cell type, (e) region, or (f) subregion, respectively. Although PU.1 cells were not sorted in ID05, other findings are similar between ID01 and ID05.
Extended Data Fig. 6.
Extended Data Fig. 6.. Evidence for HIP lineage restriction occurring prior to CTX or BG in ID01 sorted nuclear pools.
(a) Heatmap with 17 sorted nuclear samples based on sqrt-t AFs of 121 informative MVs from ID01, similar to Fig. 2c, showing greater HIP lineage separation compared with CTX or BG (purple compared with green or yellow). (b) Contour plot (at center) with 121 informative MVs derived from (a) and two kernel density estimation plots (at periphery). Axes show the absolute normalized difference value for each MV between the average AF of CTX and BG (CTX-BG) or CTX and HIP regions (CTX-HIP). Solid line: identity. Red dot: averaged x and y values of individual data points. sqrt-t AF, square-root transformed allele fraction; CTX, cortex; BG, basal ganglia; HIP, hippocampus; Cau, caudate; DG, dentate gyrus; HIP and Hip, hippocampal tissue; I, insular cortex; O, occipital cortex; P, parietal cortex; PF, prefrontal cortex; Put, putamen; T, temporal cortex; GP, globus pallidus.
Extended Data Fig. 7.
Extended Data Fig. 7.. Confidence for dendrograms with sorted nuclei from cortical areas.
(a) Bootstrapping results of ID01. (b) Bootstrapping results of ID05. The percentage of 10,000 replicates showing relationships between sqrt-t AFs for TBR1+ and DLX1+ nuclei in the same geographic region were more similar than TBR1+ nuclei from two different geographic regions (arrow for example). COUPTFII+ nuclei clustered among themselves, outside of the DLX1 and TBR1 clusters. Approximated unbiased p-value > 95% (red): the hypothesis “the cluster does not exist” rejected with a significance level (< 5%). (c-d) Heatmaps and hierarchical clustering results after computational deconvolution of DLX1+ nuclei (grey) from Fig. 3b (c) and Fig. 3c (d). (e-f) Heatmaps and hierarchical clustering results after the simulated TBR1+ nuclei contamination for COUPTFII+ nuclear pools (black) from Fig. 3b (e) and Fig. 3c (f). (g) The estimated proportion of dorsally derived cortical inhibitory neurons within deconvolved DLX1+ nuclei of each lobe. The least square method is used (Methods). 11, 13 cortical lobes for ID01 and ID05, respectively. Median, thick horizontal line at the center; 95% confidence intervals, the notch of the box plot; 75 and 25% quantiles of data, upper and lower bounds of the box; Whiskers, maxima and minima excluding outliers.
Extended Data Fig. 8.
Extended Data Fig. 8.. Quality controls of the ResolveOME dataset in ID05.
(a) A UMAP plot of snRNA-seq using 225 NEUN + nuclei and 121 aggregated reference cell types. F, frontal; T, temporal; HIP, hippocampus; REF, reference dataset. (b) UMAP labeled by cell types. Note that UMAP clusters separate by cell type (ExN, InN or Other) more than by location. (c) Relative expression of cell type markers within clusters, confirming cell identity. (d) Hierarchical clustering based on sqrt-t AFs of 34 informative MVs shared in 5 to 29 cells in single-nuclear data. F- NEUN, sorted frontal NEUN+ nuclei pool; F-sc, pseudo-bulk snMPAS data from a frontal lobe punch; T-sc, snMPAS data from a frontal (F) lobe punch. (e) Correlation between sqrt-t AFs of MVs between F- NEUN and F-sc. (f) Correlation between sqrt-t AFs of MVs between F- NEUN and T-sc. In e and f, linear regression with upper and lower 95% prediction intervals displayed by blue solid lines and gray surrounding area; sqrt-t (AF), sqrt-t AF. Pearson’s Product-Moment correlation with r and p-values (two-tailed) in e and f. (g) Null distribution of the frequency of the number of inhibitory neurons carrying MVs exclusively detected in one lobe and shared with at least two other local cells, including one excitatory neuron within the same lobe. 10,000 permutations. The portion to the right of the red dashed line, compared to the entire distribution, represents the probability (p < 0.0001, one-tailed permutation test) of having 15 or more InNs. (h-m) RNA expression levels of informative genes between InN1 (n = 17) and InN2 (n = 16) (Fig 4b) in snRNA-seq. (h) Comparable expression levels of inhibitory neuronal markers between both groups. (i) Decreased tendency for the expression of CGE-derived cell markers in InN2 compared to InN1, implying COUPTFII+ inhibitory neurons are unlikely InN1, consistent with previous observations in sorted nuclear populations. (j) RELN+ inhibitory neuronal marker showed decreased expression tendency in InN2 compared to InN1. (k) Increased expression tendency for parvalbumin-positive (PV+) inhibitory neuronal marker in InN2 compared to InN1, implying dorsally derived inhibitory neurons include PV+ neurons. (l, m) top 3 genes increased (l) or decreased (m) in InN2 compared to InN1 among the most variable 3000 protein-coding genes.
Extended Data Fig. 9.
Extended Data Fig. 9.. Phylogenic tree analysis.
(a) Phylogenic tree generated after 1000 bootstrap replications based on the 68 MVs in 118 single nuclei in Fig. 4b. Bootstrap values supporting each edge are labeled beside branches of the tree. (b) The number of pairs diverging from the latest branch that has the local highest-confident edge is shown based on the lobe and cell type. For example, the number of excitatory-excitatory neuron pairs within the same lobe clustered with the local highest-confident edge was 20.
Extended Data Fig. 10.
Extended Data Fig. 10.. UMAP plots with sorted nuclear pools based on sqrt-t AFs of 186 informative MVs from Fig. 5.
Colors of data points correspond to the spatial information in the grey box.
Figure 1.
Figure 1.. Comprehensive cMVBA identifies cell-type-resolved and region-specific MVs.
(a) cMVBA workflow overview consists of three phases: 1. Tissue collection: Cadaveric organs accessed for tissue punches from organs listed (red circles and black arrows: punch locations in organs and frontal lobe) for MV detection. 2. A subset of the bulk tissue punches undergo 300x WGS followed by best-practice MV calling pipelines to generate a list of MV candidates. 3. DNA from each punch bulk tissue, methanol fixed nuclear sorted (MFNS) samples, or individual nuclei are subjected to validation and quantification of MV candidates via massive parallel amplicon sequencing (MPAS). AFs of the validated MVs from MPAS are used to determine clonal dynamics of different neuronal cell types and reconstruct features of brain development. (b) UMAP plot from snRNA-seq with MFNS sorted or unsorted cortical nuclear pools (n = 3322). Sorted nuclear groups labeled with distinct colors. (c) Cortical cell types based on marker expression and differentially expressed genes in each cluster. (d) Cell type proportion within each sorted cortical nuclear population. (e-f) Number of MVs categorized by location detected in each donor ID01 or ID05 (Supplementary Data 4). ‘Brain-only’ MVs (i.e., detected only in brain tissue, but not other organs) including subtypes in red. ‘C+D only’: Brain-only MVs exclusively detected in both COUPTFII+ and DLX1+ populations but not the other cell types. Ast, Astrocyte; ExN, Excitatory neurons; InN, inhibitory neurons; OL, oligodendrocytes; OPC, oligodendrocyte precursor cells; U, undefined.
Figure 2.
Figure 2.. Human hippocampal lineage diverges from the cortex and basal ganglia.
(a) Model of clonal dynamics in forebrain anlage. Cells restricted to anlage A, which acquire a new MV (purple) are rarely present in anlage B. Later, B diverges into B’ and B”, which share more clones (yellow) than with anlage A. This analysis was applied to the geographies of the basal ganglia (BG), cortex (CTX) and hippocampus (HIP). (b-c) Heatmaps with 30 bulk samples from left hemispheric CTX, BG, and HIP (y-axis, b) or 12 selected sorted cell types (y-axis, c), compared with MVs identified in at least two samples (146 MVs, x-axis in (b) or 131 MVs, x-axis in (c)), depicted in Fig. 2d. Dendrograms at right shows greater HIP lineage separation (purple, arrow) compared with CTX or BG (green and yellow) either using bulk tissue (b) sorted nuclei (c), suggesting HIP earlier lineage restriction. (d) Counts of shared MVs across CTX, BG or HIP within sorted nuclear pools showing many more shared MVs between CTX and BG compared with HIP in both donors (permutation P<0.001). (e-f) Contour plots of informative 113 and 131 MVs from (b) and (c) (blue) and two kernel density estimation plots (grey). Axes show absolute normalized AF difference for each MV averaged across all samples from the respective tissues (CTX, HIP, BG). Black line: identity line, black dots: individual MVs, large red dot: averaged across all MVs, suggesting AF differences are smaller between CTX and BG than HIP. Abbreviations: Cau, caudate; DG, dentate gyrus; HIP and Hip, hippocampal tissue, where Hip refers to hippocampal subregion not specified; I, insular cortex; O, occipital cortex; P, parietal cortex; PF, prefrontal cortex; Put, putamen; T, temporal cortex; mO, medial occipital cortex; GP, globus pallidus; sqrt-t (AF), square-root transformed allele fraction.
Figure 3.
Figure 3.. Clonal dynamics of cortical excitatory and inhibitory neurons.
(a) cMVBA workflow uses MFNS nuclei for MPAS assessment of AFs in cortical punches. (b-c) Heatmaps of sorted nuclei based on AFs of 146 informative shared MVs from ID01 (b) and 186 from ID05 (c) (y-axis) compared with color-coded hemisphere, cell type or region. PF, prefrontal cortex; F, frontal cortex; P, parietal cortex; T, temporal cortex; O, occipital cortex; mO, medial occipital cortex; I, insular cortex; CC, cingulate cortex; EC, Entorhinal cortex. sqrt-t (AF), square-root transformed allelic fraction. Dendrograms indicate that subcortically derived COUTFII samples cluster together (teal), whereas DLX1 and TBR1 samples derived from the same region cluster together. (d-g) Lolliplots comparing regions (x-axis) with sqrt-t AF (y-axis) for representative MVs. Height of individual lollipop: AF; color: cell type; dashed line: threshold. Next to each lolliplot is the ‘geoclone’ representation of sqrt-t AF shaded intensity (pink) from tissue where detected. Gray boxes: not sampled. (h-i) Standard deviation (SD) of sqrt-t AFs for 146 and 186 MVs in the three different cell types in donor ID01 (h) and ID05 (i), respectively. Each dot: single MV measured in 34 vs. 45 punches across the neocortex in ID01 and ID05, respectively. One-way ANOVA with Tukey’s multiple comparison test with adjusted p-values. PF, prefrontal cortex; F, frontal cortex; P, parietal cortex; O, occipital cortex; T, temporal cortex; I, insular cortex; CC, cingulate cortex; mOC, medial occipital cortex; L, left; R, right.
Figure 4.
Figure 4.. snMPAS incorporating snRNA-seq supports the existence of dorsally derived cortical inhibitory neurons in humans.
(a) Frontal and temporal small punches of the right hemisphere of ID05 NEUN+ 191 single-nuclei DNA were subjected to primary template-directed amplification (PTA) followed by single nuclear MPAS (snMPAS) genotyping with concurrent snRNA-seq, termed ResolveOME, for analysis. (b) A double-ranked plot divided in half based on brain region (frontal: left, temporal: right). 68 MVs identified in anywhere between 2 to 14 cells (No. of detection). A total of 118 (85 excitatory and 33 inhibitory) neurons plotted. MVs were further classified according to whether they were detected in both frontal and temporal lobes (F-T Shared) vs. a single lobe F-only (purple) or T-only (blue). Dark purple or blue: detected in 3 or more nuclei, light purple or blue: detected in only 2 nuclei. ExN & InN shared: MVs shared between ExNs and InNs exclusively within F-only or T-only MVs. x-axis: cell type, i.e. ExN (green): excitatory neurons; InN1 (orange): All inhibitory neurons except for InN2. InN2 (red): Inhibitory neurons carrying at least one ExN & InN shared MV. For example, MV 13-69308268-A-G (arrow) was detected in 2 nuclei (*), an ExN and an InN. (c) Distribution MV 13-69308268-A-G across cortical areas in ID05. Left: lolliplot and Right: geoclone showing sqrt-t AFs of each sorted nuclear pool in different cortical locations, reproducing single-nucleus data. Dashed line: threshold. (d) Pseudo-bulk analysis after aggregation based on individual MVs (x-axis) and cell types and regions (y-axis). (e) Model for the shared origin of local ExN and InN cortical neurons. Dorsal clones (triangle and square) can produce both TBR1+ excitatory neurons and DLX1+ inhibitory neurons. Ventrally derived DLX1+ inhibitory neurons (circle) tangentially migrate and are more likely to disperse across the cortex.
Figure 5.
Figure 5.. Earlier establishment of the A-P axis compared to D-V restricted clonal spread (RCS) within a cortical lobe.
(a) Workflow for the observation of clonal dynamics in a lobe. A total of 17 punches were radially sampled and subjected to MFNS to assess MVs. The AF of MVs in different sites were mapped onto the geoclones (checkerboard). (b) Heatmap and dendrogram hierarchical cluster of sorted nuclear pools based on sqrt-t AFs of 186 informative MVs from ID05 detected in the right parietal lobe. Sidebars left of the heatmap: cell type and sample location information, according to the convention in Fig. 3. Colors for lobar location taken from (a). Dendrogram highlights two main clusters (C1: blue; C2: red) that when mapped back onto the sampled spatial coordinates (c) are separated in the AP dimension (red and blue circle). (d) Geoclones of three individual MVs from b (box, arrows), showing shades of pink are more different in the A-P axis than in the D-V axis. Gray box: not available. (e-f) Cell-type-resolved contour plots of 94 shared MVs from b (center) with colored kernel density estimation plots (in the periphery) for DLX1 (top) or TBR1 (bottom), showing MVs from both cell types with a greater normalized difference of sqrt-t AFs in A-P than in D-V. Grey: kernel density estimation plot of the orthogonal axis. Arrow head: local peak of the density estimation plot due to MVs with greater difference of AFs in A-P than D-V. Black line: identity line. Dots: individual MVs, large red dot: average across all MVs.

Update of

References

    1. Anderson SA, Eisenstat DD, Shi L & Rubenstein JL Interneuron migration from basal forebrain to neocortex: dependence on Dlx genes. Science 278, 474–476, doi:10.1126/science.278.5337.474 (1997). - DOI - PubMed
    1. Letinic K, Zoncu R & Rakic P Origin of GABAergic neurons in the human neocortex. Nature 417, 645–649, doi:10.1038/nature00779 (2002). - DOI - PubMed
    1. Wonders CP & Anderson SA The origin and specification of cortical interneurons. Nat Rev Neurosci 7, 687–696, doi:10.1038/nrn1954 (2006). - DOI - PubMed
    1. Petanjek Z, Berger B & Esclapez M Origins of cortical GABAergic neurons in the cynomolgus monkey. Cereb Cortex 19, 249–262, doi:10.1093/cercor/bhn078 (2009). - DOI - PMC - PubMed
    1. Hansen DV et al. Non-epithelial stem cells and cortical interneuron production in the human ganglionic eminences. Nat Neurosci 16, 1576–1587, doi:10.1038/nn.3541 (2013). - DOI - PMC - PubMed

Method-only references

    1. Huang AY et al. MosaicHunter: accurate detection of postzygotic single-nucleotide mosaicism through next-generation sequencing of unpaired, trio, and paired samples. Nucleic Acids Res 45, e76, doi:10.1093/nar/gkx024 (2017). - DOI - PMC - PubMed
    1. Breuss MW et al. Autism risk in offspring can be assessed through quantification of male sperm mosaicism. Nat Med 26, 143–150, doi:10.1038/s41591-019-0711-0 (2020). - DOI - PMC - PubMed
    1. Cibulskis K et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol 31, 213–219, doi:10.1038/nbt.2514 (2013). - DOI - PMC - PubMed
    1. Yang X et al. Control-independent mosaic single nucleotide variant detection with DeepMosaic. Nat Biotechnol 41, 870–877, doi:10.1038/s41587-022-01559-w (2023). - DOI - PMC - PubMed
    1. Dou Y et al. Accurate detection of mosaic variants in sequencing data without matched controls. Nat Biotechnol 38, 314–319, doi:10.1038/s41587-019-0368-8 (2020). - DOI - PMC - PubMed

Publication types

MeSH terms

Substances