Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Sep 4;11(9):1046.
doi: 10.3390/genes11091046.

An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice

Affiliations
Review

An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice

Tanguy Lallemand et al. Genes (Basel). .

Abstract

Gene duplication is an important evolutionary mechanism allowing to provide new genetic material and thus opportunities to acquire new gene functions for an organism, with major implications such as speciation events. Various processes are known to allow a gene to be duplicated and different models explain how duplicated genes can be maintained in genomes. Due to their particular importance, the identification of duplicated genes is essential when studying genome evolution but it can still be a challenge due to the various fates duplicated genes can encounter. In this review, we first describe the evolutionary processes allowing the formation of duplicated genes but also describe the various bioinformatic approaches that can be used to identify them in genome sequences. Indeed, these bioinformatic approaches differ according to the underlying duplication mechanism. Hence, understanding the specificity of the duplicated genes of interest is a great asset for tool selection and should be taken into account when exploring a biological question.

Keywords: bioinformatic tools; gene duplication; genome evolution; paralogous genes; synteny.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The different types of duplications. (A) Whole genome duplication which implies complete chromosome duplication. (B) Tandem duplications which produce identical adjacent sequences. (C) Retroduplication, which produces a retrocopy of a gene devoid of introns and with a polyA tail. (D) Transduplication in which a DNA transposon acquires fragments of genes. (E) Segmental duplications which correspond to long stretches of duplicated sequences with high identity.

References

    1. Ohno S. Evolution by Gene Duplication. Springer; Berlin/Heidelberg, Germany: 1970.
    1. Kondrashov F.A. Gene duplication as a mechanism of genomic adaptation to a changing environment. Proc. R. Soc. B Biol. Sci. 2012;279:5048–5057. doi: 10.1098/rspb.2012.1108. - DOI - PMC - PubMed
    1. Van de Peer Y., Maere S., Meyer A. The evolutionary significance of ancient genome duplications. Nat. Rev. Genet. 2009;10:725–732. doi: 10.1038/nrg2600. - DOI - PubMed
    1. Vallejo-Marín M., Buggs R.J.A., Cooley A.M., Puzey J.R. Speciation by genome duplication: Repeated origins and genomic composition of the recently formed allopolyploid species Mimulus peregrinus. Evolution. 2015;69:1487–1500. doi: 10.1111/evo.12678. - DOI - PMC - PubMed
    1. Ting C.T., Tsaur S.C., Sun S., Browne W.E., Chen Y.C., Patel N.H., Wu C.I. Gene duplication and speciation in Drosophila: Evidence from the Odysseus locus. Proc. Natl. Acad. Sci. USA. 2004;101:12232–12235. doi: 10.1073/pnas.0401975101. - DOI - PMC - PubMed

Publication types

LinkOut - more resources