Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 4;22(1):794.
doi: 10.1186/s12864-021-08066-3.

During evolution from the earliest tetrapoda, newly-recruited genes are increasingly paralogues of existing genes and distribute non-randomly among the chromosomes

Affiliations

During evolution from the earliest tetrapoda, newly-recruited genes are increasingly paralogues of existing genes and distribute non-randomly among the chromosomes

Wilfred D Stein et al. BMC Genomics. .

Abstract

Background: The present availability of full genome sequences of a broad range of animal species across the whole range of evolutionary history enables one to ask questions as to the distribution of genes across the chromosomes. Do newly recruited genes, as new clades emerge, distribute at random or at non-random locations?

Results: We extracted values for the ages of the human genes and for their current chromosome locations, from published sources. A quantitative analysis showed that the distribution of newly-added genes among and within the chromosomes appears to be increasingly non-random if one observes animals along the evolutionary series from the precursors of the tetrapoda through to the great apes, whereas the oldest genes are randomly distributed.

Conclusions: Randomization will result from chromosome evolution, but less and less time is available for this process as evolution proceeds. Much of the bunching of recently-added genes arises from new gene formation as paralogues in gene families, near the location of genes that were recruited in the preceding phylostratum. As examples we cite the KRTAP, ZNF, OR and some minor gene families. We show that bunching can also result from the evolution of the chromosomes themselves when, as for the KRTAP genes, blocks of genes that had previously been on disparate chromosomes become linked together.

Keywords: Chromosomes; Gene ages; Gene distribution; Gene evolution; Newly-recruited genes; Paralogues; Phylostratigraphy.

PubMed Disclaimer

Conflict of interest statement

Not applicable.

Figures

Fig. 1
Fig. 1
Distribution of newly recruited genes across the autosomal human chromosomes for phylostrata 19.2 (left) and 12 (middle) and for the zebrafish chromosomes for phylostratum 12 (right). The data are presented as the ratio of the content of the genes from the respective phylostratum to the gene content of the whole chromosome (as %), divided by the median of each data set and arranged in order of increasing gene content
Fig. 2
Fig. 2
The distribution of newly recruited genes across the autosomal chromosomes of H. sapiens, measured as MAD values (see Methods) as a function of phylostratum number. The open circles in each figure show the data for the phylostratum numbers found as the modal values for the 13 ortholog search engines studied. The red circles in A show the data taken, for every gene, from the lowest estimate among the 13 sources while the blue circles show these for the highest estimate among the 3 sources. In B, the red circles show the data obtained as the median of those obtained for those three sources that, over the entire genome gave the lowest age estimates, while the blue circles were from the three sources that similarly gave the highest overall age estimates
Fig. 3
Fig. 3
Distribution of newly recruited genes across the autosomal human chromosomes. A: All the genes assigned to phylostrata 15 through 19 by a majority of the 13 ortholog search engines. B: Those similarly assigned to phylostrata 1 through 13. The data are presented as the ratio of the total number of the genes from the respective phylostratum grouping to the gene content of the whole chromosome (as %), and arranged in order of increasing % gene content. In part A, chromosome 19 is on the extreme right, whereas in part B it is on the extreme left
Fig. 4
Fig. 4
Regression of the content of (A) “young” genes (those assigned to phylostrata 15 through 19 or (B) “old” genes (those assigned to phylostrata 1 through 13) against the GC content of the chromosomes. The red-encircled points in both figures depict chromosome 5 while the blue-encircled points show chromosome 19
Fig. 5
Fig. 5
The distribution of newly recruited genes across the autosomal chromosomes, measured as MAD values – see Methods - as a function of phylostratum number, for eight animal species. The horizontal lines drawn are the median, and the 25% and the 75% limits, computed for all the data through to phylostratum 12, the euteleostomii (the jawed fish)
Fig. 6
Fig. 6
Heat maps of the distribution across the 20 sections (the columns) of human chromosome 19 of all its genes (upper figure) and of only its Zinc Finger (ZNF) genes (lower figure), the rows of the map being data for each numbered phylostratum in the 20 sections. For the full genome, the numbers are the percentage of all the genes of the phylostratum denoted, while for the ZNF data the denominator is the number of ZNF genes in the denoted phylostratum
Fig. 7
Fig. 7
(A) The Zinc Finger (ZNF) genes of the human genome as a function of phylostratum age (left) or as their chromosome location (right); (B) similarly for the Keratin-Associated Protein (KRTAP) genes; (C) similarly again for the Olfactory Receptor (OR) genes

References

    1. Domazet-Loso T, Brajković JTD. A phylostratigraphy approach to uncover the genomic history of major adaptations in metazoan lineages. Trends Genet. 2007;23(11):533–539. doi: 10.1016/j.tig.2007.08.014. - DOI - PubMed
    1. Domazet-Loso T, Tautz D. Phylostratigraphic tracking of cancer genes suggests a link to the emergence of multicellularity in metazoa. BMC Biol. 2010;8(1):66. doi: 10.1186/1741-7007-8-66. - DOI - PMC - PubMed
    1. Sestak MS, Domazet-Loso T. Phylostratigraphic profiles in zebrafish uncover chordate origins of the vertebrate brain. Mol Biol Evol. 2015;32(2):299–312. doi: 10.1093/molbev/msu319. - DOI - PMC - PubMed
    1. Arendsee Z, Li J, Singh U, Seetharam A, Dorman K, Wurtele ES. Phylostratr: a framework for phylostratigraphy. Bioinformatics. 2019;35(19):3517–3627. doi: 10.1093/bioinformatics/btz171. - DOI - PubMed
    1. Litman T, Stein WD. Obtaining estimates for the ages of all the protein-coding genes and most of the ontology-identified noncoding genes of the human genome, assigned to 19 phylostrata. Semin Oncol. 2018;46(1):3–9. doi: 10.1053/j.seminoncol.2018.11.002. - DOI - PubMed