Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 1;9(1):giz159.
doi: 10.1093/gigascience/giz159.

A genome alignment of 120 mammals highlights ultraconserved element variability and placenta-associated enhancers

Affiliations

A genome alignment of 120 mammals highlights ultraconserved element variability and placenta-associated enhancers

Nikolai Hecker et al. Gigascience. .

Abstract

Background: Multiple alignments of mammalian genomes have been the basis of many comparative genomic studies aiming at annotating genes, detecting regions under evolutionary constraint, and studying genome evolution. A key factor that affects the power of comparative analyses is the number of species included in a genome alignment.

Results: To utilize the increased number of sequenced genomes and to provide an accessible resource for genomic studies, we generated a mammalian genome alignment comprising 120 species. We used this alignment and the CESAR method to provide protein-coding gene annotations for 119 non-human mammals. Furthermore, we illustrate the utility of this alignment by 2 exemplary analyses. First, we quantified how variable ultraconserved elements (UCEs) are among placental mammals. Leveraging the high taxonomic coverage in our alignment, we estimate that UCEs contain on average 4.7%-15.6% variable alignment columns. Furthermore, we show that the center regions of UCEs are generally most constrained. Second, we identified enhancer sequences that are only conserved in placental mammals. We found that these enhancers are significantly associated with placenta-related genes, suggesting that some of these enhancers may be involved in the evolution of placental mammal-specific aspects of the placenta.

Conclusion: The 120-mammal alignment and all other data are available for analysis and visualization in a genome browser at https://genome-public.pks.mpg.de/and for download at https://bds.mpi-cbg.de/hillerlab/120MammalAlignment/.

Keywords: comparative gene annotation; enhancers; genome alignment; mammals; ultraconserved elements.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Phylogeny of 120 mammals included in our alignment and number of annotated genes. Bars visualize the number of human genes for which we projected ≥1 intact exon. Major groups of mammals are indicated. The 57 Laurasiatheria species are shown on the right side for space reasons.
Figure 2:
Figure 2:
Example of a sequence alignment of 120 mammals showing an 88-bp region inside a UCE. This UCE is located in an intron of the DACH1 gene, which encodes a transcription factor important for development. Dots in the 120-mammal alignment refer to bases that are identical to those in the human genome. For space reasons, 25 primates, 13 carnivora, and 10 bats that all have identical sequence to human are not shown. Green and blue fonts indicate species of different clades. The alignment of this ultraconserved region shows that most columns are identical across all 120 mammals but also reveals a few substitutions. Some of these substitutions are species-specific and may be attributed to base errors in the assembly. Other substitutions are shared among independently sequenced genomes of related species (red boxes), which makes base errors very unlikely. We used shared substitutions to calculate a lower bound for the percentage of UCE positions that can vary across placental mammals. We used both shared and species-specific substitutions to calculate an upper bound for this percentage.
Figure 3:
Figure 3:
Variability of UCEs across placental mammals. For each alignment position in the 441 UCEs for which ≥110 placental mammals had aligning sequence in our genome alignment, we examined whether positions in the UCE are identical or were substituted at least once across the 115 non-human placental mammals. (A) Violin and box plots show the distribution of the fraction of variable positions per UCE across placental mammals. The white box spans the first to the third quartile, the middle line indicates the median. In addition to considering all 115 non-human placental mammals, we also determined the fraction of variable positions per UCE considering only 60 non-human placental mammals. This illustrates that analyzing fewer species would underestimate UCE variability. (B) Bar plots show the number of substitutions observed in UCEs with respect to their relative position in UCEs. UCEs were divided into 100 equally sized bins. Both upper and lower bounds show that UCEs are more variable at their flanks than in their center.

References

    1. Miller W, Makova KD, Nekrutenko A, et al. .. Comparative genomics. Annu Rev Genomics Hum Genet. 2004;5:15–56. - PubMed
    1. Alfoldi J, Lindblad-Toh K. Comparative genomics as a tool to understand evolution and disease. Genome Res. 2013;23(7):1063–8. - PMC - PubMed
    1. Stark A, Lin MF, Kheradpour P, et al. .. Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures. Nature. 2007;450(7167):219–32. - PMC - PubMed
    1. Lindblad-Toh K, Garber M, Zuk O, et al. .. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011;478(7370):476–82. - PMC - PubMed
    1. Washietl S, Hofacker IL, Stadler PF. Fast and reliable prediction of noncoding RNAs. Proc Natl Acad Sci U S A. 2005;102(7):2454–9. - PMC - PubMed

Publication types