Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 17;15(1):4220.
doi: 10.1038/s41467-024-48392-0.

Mapping recurrent mosaic copy number variation in human neurons

Collaborators, Affiliations

Mapping recurrent mosaic copy number variation in human neurons

Chen Sun et al. Nat Commun. .

Abstract

When somatic cells acquire complex karyotypes, they often are removed by the immune system. Mutant somatic cells that evade immune surveillance can lead to cancer. Neurons with complex karyotypes arise during neurotypical brain development, but neurons are almost never the origin of brain cancers. Instead, somatic mutations in neurons can bring about neurodevelopmental disorders, and contribute to the polygenic landscape of neuropsychiatric and neurodegenerative disease. A subset of human neurons harbors idiosyncratic copy number variants (CNVs, "CNV neurons"), but previous analyses of CNV neurons are limited by relatively small sample sizes. Here, we develop an allele-based validation approach, SCOVAL, to corroborate or reject read-depth based CNV calls in single human neurons. We apply this approach to 2,125 frontal cortical neurons from a neurotypical human brain. SCOVAL identifies 226 CNV neurons, which include a subclass of 65 CNV neurons with highly aberrant karyotypes containing whole or substantial losses on multiple chromosomes. Moreover, we find that CNV location appears to be nonrandom. Recurrent regions of neuronal genome rearrangement contain fewer, but longer, genes.

PubMed Disclaimer

Conflict of interest statement

J.V.M. is an inventor on patent US6150160, is a paid consultant for Gilead Sciences, serves on the scientific advisory board of Tessera Therapeutics Inc. (where he is paid as a consultant, and has equity options), has licensed reagents to Merck Pharmaceutical, and recently served on the American Society of Human Genetics Board of Directors. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. SCOVAL: identification of copy number variation using read-depth and allele imbalance.
Overview of SCOVAL. A Single nuclei and bulk dural fibroblast DNA were analyzed using 10X platforms. (Images from vecteezy.com) B Single nuclei library quality is assessed based on median absolute deviation (MAD) and copy number thresholds are established using population statistics. Graphs depict schematized data; vertical red lines illustrate threshold strategy. C Candidate CNVs are identified based on altered read depth across consecutive genomic bins. D Heterozygous SNPs are phased using bulk linked-reads in chromosomal segments (“hap 1” or “hap 2”). E Absolute log2 ratios derived from “hap 1”/“hap 2” are calculated across ~100 SNP windows (see text). A deletion with concordant loss of heterozygosity (log2 ratio <> 0) is illustrated. F A highly aberrant CNV neuron (#5) shows representative Gingko calls (blue bars), duplications (e.g., green arrow), heterozygous deletions (e.g., black arrow), and homozygous deletions (e.g., orange arrow) and qualitatively concordant increases in absolute log2 ratio (white<purple). The genome is plotted from left to right on the x-axis, read-depth is in the upper panel (CN state on the Y-axis), and absolute log2 ratios are reported in the lower panel.
Fig. 2
Fig. 2. CNV neurons can have highly aberrant karyotypes.
A The observed CNV per neuron [(purple bars, counts (y-axis),CNVs/neuron (x-axis)] distribution deviates (P < 0.0001) from Poisson expectations (dashed blue line). Arrows indicate neurons with monosomic chromosomes. B, C Deletions cluster in a subset of CNV neurons. B Counts (y-axis) of the cumulative percent of each chromosome deleted (n = 2097 neurons * 22 autosomes) in CNV neurons. C Neuronal genomes (n = 2097) are arranged in a cells-by-chromosome matrix, ranked by the total percentage of their genome containing deletions. Cell #226 is the first CNV neuron among 2097 total neurons with the smallest observed single deletion (blue = unaffected chromosome, yellow <50%, orange = 50−99%, red 100%). DF Among 65 neurons with the most aberrant genomes, some have similar karyotypes. D Hierarchical clustering identifies two groups (yellow, red) with the least divergence from similarity (y-axis). E Red cluster neurons [cells #32, 33, and 47 in (C)] have similar CNV profiles. Read-depth is plotted as in Fig. 1F. The yellow cluster (cells #17 and #19) is shown in Fig. S6B. F Concordant read-depth is observed on opposite haplotypes in the most similar pair [#32(red) and #33(blue)]. When overlapping, events on cell #47 (green) match the #32 haplotype, but never the #33 haplotype. Chromosome 3 is plotted from left to right. Haplotype log2 ratio (upper panel) and corresponding read-depth (lower panel, blue = diploid) plots show overlapping deletions and LOH for each haplotype.
Fig. 3
Fig. 3. Analysis of CNV distribution relative to random null model.
A Empirical read-depth plots of two CNV neurons (left panels) and representative permutations (right two panels) are displayed as in Fig. 1F. B Relative to 10,000 permutations of real data (represented by blue dotted line and error bars), high and low CNV burden are enriched at the extremities of the Gaussian distribution (green bars). C Circos plot shows that hotspots (red, outer tier) and cold spots (blue, outer tier) cluster on distinct chromosomes. Thirty-three pathogenic CNVs (blue, purple, inner tier) never overlap hotspots. Eleven (blue) overlap cold spots. D Violin plot (mean +/- SD) showing gene enrichment in cold spots (left, N = 56, mean gene hits = 57.71 ± 58.93) and depletion in hotspots (right, N = 83, mean gene hits = 15.65 ± 32.40) relative to other 5 Mb regions (N = 404 controls, mean gene hits 37.79 ± 43.71) with chi-square P < 0.001 for cold spots vs. controls and hotspots vs. controls.
Fig. 4
Fig. 4. Recurrent CNV breakpoints across multiple neurons.
A UCSC Genome Browser view of all CNVs detected on Chromosome 1 (47 neurons, rows). Seven neurons (red) contain CNVs that share a breakpoint region (CNVB). B Representative CNVB (red) on Chromosome 1 overlaps (±250 kb) two genes (lower panel). C Number of breakpoints identified in each Ginkgo bin (y-axis) relative to bin size (x-axis), shown for bins containing two or more CNVs (red) and averaged across all permutations in control set (blue line). DF Violin plots show real and permuted datasets, normalized by bin size, when examined for D number of breakpoints, E number of long (>100k) genes (****p < 0.0001 for one-tailed t-test), and F transcripts per million bp (TPM) values of the longest gene in each bin.

References

    1. Hozumi N, Tonegawa S. Evidence for somatic rearrangement of immunoglobulin genes coding for variable and constant regions. Proc. Natl Acad. Sci. USA. 1976;73:3628–3632. doi: 10.1073/pnas.73.10.3628. - DOI - PMC - PubMed
    1. Schrock E, et al. Multicolor spectral karyotyping of human chromosomes. Science. 1996;273:494–497. doi: 10.1126/science.273.5274.494. - DOI - PubMed
    1. Speicher MR, Gwyn Ballard S, Ward DC. Karyotyping human chromosomes by combinatorial multi-fluor FISH. Nat. Genet. 1996;12:368–375. doi: 10.1038/ng0496-368. - DOI - PubMed
    1. Padilla-Nash HM, et al. Jumping translocations are common in solid tumor cell lines and result in recurrent fusions of whole chromosome arms. Genes Chromosomes Cancer. 2001;30:349–363. doi: 10.1002/gcc.1101. - DOI - PubMed
    1. Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144:646–674. doi: 10.1016/j.cell.2011.02.013. - DOI - PubMed