Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Aug;181 Suppl 76(Suppl 76):118-144.
doi: 10.1002/ajpa.24713. Epub 2023 Feb 16.

Genomic structural variation: A complex but important driver of human evolution

Affiliations
Review

Genomic structural variation: A complex but important driver of human evolution

Daniela C Soto et al. Am J Biol Anthropol. 2023 Aug.

Abstract

Structural variants (SVs)-including duplications, deletions, and inversions of DNA-can have significant genomic and functional impacts but are technically difficult to identify and assay compared with single-nucleotide variants. With the aid of new genomic technologies, it has become clear that SVs account for significant differences across and within species. This phenomenon is particularly well-documented for humans and other primates due to the wealth of sequence data available. In great apes, SVs affect a larger number of nucleotides than single-nucleotide variants, with many identified SVs exhibiting population and species specificity. In this review, we highlight the importance of SVs in human evolution by (1) how they have shaped great ape genomes resulting in sensitized regions associated with traits and diseases, (2) their impact on gene functions and regulation, which subsequently has played a role in natural selection, and (3) the role of gene duplications in human brain evolution. We further discuss how to incorporate SVs in research, including the strengths and limitations of various genomic approaches. Finally, we propose future considerations in integrating existing data and biospecimens with the ever-expanding SV compendium propelled by biotechnology advancements.

Keywords: brain; gene duplications; genomes; human evolution; structural variation.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.. Examples of genomic structural variation.
SVs exist as deletions and duplications (with the largest, most similar duplications termed segmental duplications, or SDs) that change the copy of a genomic segment (i.e., CNVs). Other types of SVs include insertions, translocations, inversions, as well as more complex events not pictured. Figure is adapted from (Alkan, Coe, et al., 2011) via “Genome Structural Variations” by BioRender.com (2022). Retrieved from https://app.biorender.com/biorender-templates.
Figure 2.
Figure 2.. Cladogram of Hominidae family.
Divergence time estimates were obtained from Sudmant et al. (2013).
Figure 3.
Figure 3.. Difficulties appraising haplotypes between SVs and neighboring SNVs.
(A) Neighboring SNVs (orange circles) are difficult to detect when an SV (colored rectangle) is embedded in repeat-rich regions. (B, C) Haplotypes can be disrupted by (B) IGC and (C) recurrent deletions (H) and duplication (H’). (D) mCNVs can be in the same locus (H) or several kilobases apart (H’).
Figure 4.
Figure 4.. Summary of human duplicated genes implicated in neurodevelopmental functions.
Partially duplicated SRGAP2C antagonizes ancestral SRGAP2A through dimerization of their F-BAR domains, causing its degradation and resulting in increases in the rate of neuronal migration, density of dendritic spines, long-range neuronal connections, and synapse density, as well as delayed neuronal maturation. Truncated ARHGAP11B carries 55 distinct terminal amino acids that result in a loss of its ancestral RhoGAP activity, increasing calcium levels in the mitochondria that result in increased glutaminolysis and a higher abundance of basal progenitors that lead to presence of gyrification. HSE of TBC1D3 located in a core duplicon has been expanded multiple times. Studies have revealed an increase in cortical progenitors and subsequently neurons in the presence of this gene, ultimately resulting in mice with gyrencephalic brains. Incomplete duplication of the N-terminal portion of NOTCH2 and subsequent expansion gave rise to several NOTCH2NL paralogs that remained functional likely due to IGC events and that have been found to directly increase the abundance of cortical progenitors and neurons. Figure created with BioRender.com.
Figure 5.
Figure 5.. Transcriptional profiles of a subset HSD genes across corticogenesis using data from differentiated hESCs.
RNA-seq quantification across a time-course of 77 days to mimic developmental stages including: pluripotency (plur.), differentiation (diff.), cortical specification (cort. spec.), deep layer formation (form.) and upper layer formation (upp. lay. form.). Expression levels displayed as z-scores of HSD genes in relation to the complete transcriptome indicated as colors (red = high; blue = low expression).
Figure 6.
Figure 6.. Short- and long-read SV discovery signals.
R: Reference. S: Sample. SRS: short-read sequencing. LRS: long-read sequencing. Dashed line connects two pairs from the same short-read sequencing DNA fragment. Pink shapes represent long reads.
Figure 7.
Figure 7.. Genomic artifacts arising from errors in the human reference genome assembly.
False positive heterozygous SNV calls originating from missing copies in the reference due to identification of paralog-sequence variants due to reads mapping from multiple paralogs.
Figure 8.
Figure 8.. Differences in mappability between short and long reads in duplicated genes.
Paralog-specific variants (PSVs) (vertical lines) can distinguish paralogs enabling detection of polymorphic variation (yellow dots). Reads that do not carry PSVs (dashed lines) are unmappable in duplicated regions. SRS: short-read sequencing. LRS: long-read sequencing.

Similar articles

Cited by

References

    1. 1000 Genomes Project Consortium, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, … McVean GA (2010). A map of human genome variation from population-scale sequencing. Nature, 467(7319), 1061–1073. - PMC - PubMed
    1. 1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, … Abecasis GR (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. - PMC - PubMed
    1. Abel HJ, Larson DE, Regier AA, Chiang C, Das I, Kanchi KL, … Hall IM (2020). Mapping and characterization of structural variation in 17,795 human genomes. Nature, 583(7814), 83–89. - PMC - PubMed
    1. Aganezov S, Yan SM, Soto DC, Kirsche M, Zarate S, Avdeyev P, … Schatz MC (2022). A complete reference genome improves analysis of human genetic variation. Science, 376(6588), eabl3533. - PMC - PubMed
    1. Aguirre M, Rivas MA, & Priest J (2019). Phenome-wide Burden of Copy-Number Variation in the UK Biobank. American Journal of Human Genetics, 105(2), 373–383. - PMC - PubMed

Publication types