Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 Jan 1;10(1):166-188.
doi: 10.1093/gbe/evx240.

Human Accelerated Regions and Other Human-Specific Sequence Variations in the Context of Evolution and Their Relevance for Brain Development

Affiliations
Review

Human Accelerated Regions and Other Human-Specific Sequence Variations in the Context of Evolution and Their Relevance for Brain Development

Anastasia Levchenko et al. Genome Biol Evol. .

Abstract

The review discusses, in a format of a timeline, the studies of different types of genetic variants, present in Homo sapiens, but absent in all other primate, mammalian, or vertebrate species, tested so far. The main characteristic of these variants is that they are found in regions of high evolutionary conservation. These sequence variations include single nucleotide substitutions (called human accelerated regions), deletions, and segmental duplications. The rationale for finding such variations in the human genome is that they could be responsible for traits, specific to our species, of which the human brain is the most remarkable. As became obvious, the vast majority of human-specific single nucleotide substitutions are found in noncoding, likely regulatory regions. A number of genes, associated with these human-specific alleles, often through novel enhancer activity, were in fact shown to be implicated in human-specific development of certain brain areas, including the prefrontal cortex. Human-specific deletions may remove regulatory sequences, such as enhancers. Segmental duplications, because of their large size, create new coding sequences, like new functional paralogs. Further functional study of these variants will shed light on evolution of our species, as well as on the etiology of neurodevelopmental disorders.

Keywords: deletions; duplications; genes; neurodevelopmental disorders; psychiatry; substitutions.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
—Examples of sequence variants described in the present review. (a–c) Actual examples of a HAR, several duplications, and a deletion. Screen shots from the UCSC Genome Browser are used. The species, genome assembly, chromosomal coordinates, and size of the region in bp are shown. (a) HAR1 is depicted. The coordinates of the region in the human NCBI genome assembly 35 are as defined in Pollard et al. (2006a). The twelve human-specific substitutions are denoted with asterisks. Dots within the alignment denote bases identical in human and other species. (b) The paralog SRGAP2B and the partially duplicated and inverted pseudogene SRGAP2D are pointed by arrows. Neighboring genes FAM72 are depicted; this picture however contradicts results in Dennis et al. (2017): SRGAP2B duplicated within the same genomic fragment as FAM72B and SRGAP2D duplicated within the same genomic fragment as FAM72D. This contradiction illustrates the incompleteness of the current human genome assembly. The depicted gene FAM72C is actually located on chromosome 1p11.2. The gene NBPF15 with primate-specific DUF120 domain amplification, denoted with a star, is also shown in this genomic fragment. The multispecies alignment depicts factual absence of this gene in nonprimate animals. The picture is similar for the 100 vertebrates’ alignment. (c) hCONDEL.122 as defined in McLean et al. (2011). Build 2 Version 1 (Oct 2005) of the chimpanzee genome assembly, produced by the Chimpanzee Sequencing and Analysis Consortium, is shown. The corresponding human sequence contains a gap (the deletion). Other species are shown for comparison. Single lines signify gaps (sequence absent) and double lines signify more complex cases that are more difficult to resolve. In cases where multiple chains of the same species align over a region of the chimpanzee genome, the chains with single-lined gaps are often due to retrotransposed pseudogenes, whereas chains with double-lined gaps are more often due to paralogs and duplicated pseudogenes. (d and e) Generalized examples of functional variants. The examples do not represent particular regulatory sequences or coding genes, but illustrate principles upon which these sequences function, according to studies, described in the present review. The schematics are not to scale. (d) A HAR can act as an enhancer. In this case, transcriptional activity and/or expression pattern in tissues are altered. (e) Segmental duplications may result in paralogs with new functions. The paralogs are found in different chromosomal regions. The new paralogs are also often inverted relative to the parental sequence. The fact that the duplications are partial explains differences between the ancestral gene and the paralogs. Additionally, new sequence variants, denoted with a star, may be introduced in paralogs and will result in further alterations of the protein sequence. An example is the splicing variant in the gene ARHGAP11B. Although pseudogenes are often regarded as “dead genes,” there is evidence that these noncoding gens may generate functional RNAs that regulate gene expression (Pink et al. 2011).
<sc>Fig</sc>. 1.
Fig. 1.
—Examples of sequence variants described in the present review. (a–c) Actual examples of a HAR, several duplications, and a deletion. Screen shots from the UCSC Genome Browser are used. The species, genome assembly, chromosomal coordinates, and size of the region in bp are shown. (a) HAR1 is depicted. The coordinates of the region in the human NCBI genome assembly 35 are as defined in Pollard et al. (2006a). The twelve human-specific substitutions are denoted with asterisks. Dots within the alignment denote bases identical in human and other species. (b) The paralog SRGAP2B and the partially duplicated and inverted pseudogene SRGAP2D are pointed by arrows. Neighboring genes FAM72 are depicted; this picture however contradicts results in Dennis et al. (2017): SRGAP2B duplicated within the same genomic fragment as FAM72B and SRGAP2D duplicated within the same genomic fragment as FAM72D. This contradiction illustrates the incompleteness of the current human genome assembly. The depicted gene FAM72C is actually located on chromosome 1p11.2. The gene NBPF15 with primate-specific DUF120 domain amplification, denoted with a star, is also shown in this genomic fragment. The multispecies alignment depicts factual absence of this gene in nonprimate animals. The picture is similar for the 100 vertebrates’ alignment. (c) hCONDEL.122 as defined in McLean et al. (2011). Build 2 Version 1 (Oct 2005) of the chimpanzee genome assembly, produced by the Chimpanzee Sequencing and Analysis Consortium, is shown. The corresponding human sequence contains a gap (the deletion). Other species are shown for comparison. Single lines signify gaps (sequence absent) and double lines signify more complex cases that are more difficult to resolve. In cases where multiple chains of the same species align over a region of the chimpanzee genome, the chains with single-lined gaps are often due to retrotransposed pseudogenes, whereas chains with double-lined gaps are more often due to paralogs and duplicated pseudogenes. (d and e) Generalized examples of functional variants. The examples do not represent particular regulatory sequences or coding genes, but illustrate principles upon which these sequences function, according to studies, described in the present review. The schematics are not to scale. (d) A HAR can act as an enhancer. In this case, transcriptional activity and/or expression pattern in tissues are altered. (e) Segmental duplications may result in paralogs with new functions. The paralogs are found in different chromosomal regions. The new paralogs are also often inverted relative to the parental sequence. The fact that the duplications are partial explains differences between the ancestral gene and the paralogs. Additionally, new sequence variants, denoted with a star, may be introduced in paralogs and will result in further alterations of the protein sequence. An example is the splicing variant in the gene ARHGAP11B. Although pseudogenes are often regarded as “dead genes,” there is evidence that these noncoding gens may generate functional RNAs that regulate gene expression (Pink et al. 2011).
<sc>Fig</sc>. 2.—
Fig. 2.—
The timeline of studies describing human-specific sequence variations, mentioned in the present review.
<sc>Fig</sc>. 3.—
Fig. 3.—
Intersection statistics for the six HAR data sets. (a) A heatmap showing pairwise comparison between data sets. (b) Intersection between data sets. Each data set is represented by a black filled circle. A vertical black line connects the circles to emphasize intersections between corresponding data sets. The number of intersections between HARs is shown as a bar chart. The arrow indicates intersection between the six data sets. Bush HARs from Bush and Lahn (2008), Pollard HARs from Pollard et al. (2006a), Gittelman haDHSs from Gittelman et al. (2015), Lindblad-Toh 2xHARs from Lindblad-Toh et al. (2011), Prabhakar HACNSs from Prabhakar et al. (2006), and Bird ANCs from Bird et al. (2007).

Similar articles

Cited by

References

    1. 1000 Genomes Project Consortium. 2012. An integrated map of genetic variation from 1, 092 human genomes. Nature 491:56–65.http://dx.doi.org/10.1038/nature11632 - DOI - PMC - PubMed
    1. Abdel-Mannan O, Cheung AF, Molnar Z.. 2008. Evolution of cortical neurogenesis. Brain Res Bull. 75(2–4):398–404. - PubMed
    1. Allen NJ. 2012. Astrocyte glypicans 4 and 6 promote formation of excitatory synapses via GluA1 AMPA receptors. Nature 486(7403):410–414. - PMC - PubMed
    1. Antonacci F, et al.2014. Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion and evolutionary instability. Nat Genet. 46(12):1293–1302.http://dx.doi.org/10.1038/ng.3120 - DOI - PMC - PubMed
    1. Asahara H, et al.2002. Dual roles of p300 in chromatin assembly and transcriptional activation in cooperation with nucleosome assembly protein 1 in vitro. Mol Cell Biol. 22(9):2974–2983.http://dx.doi.org/10.1128/MCB.22.9.2974-2983.2002 - DOI - PMC - PubMed

Publication types