Many species in one: DNA barcoding overestimates the number of species when nuclear mitochondrial pseudogenes are coamplified
- PMID: 18757756
- PMCID: PMC2527351
- DOI: 10.1073/pnas.0803076105
Many species in one: DNA barcoding overestimates the number of species when nuclear mitochondrial pseudogenes are coamplified
Abstract
Nuclear mitochondrial pseudogenes (numts) are nonfunctional copies of mtDNA in the nucleus that have been found in major clades of eukaryotic organisms. They can be easily coamplified with orthologous mtDNA by using conserved universal primers; however, this is especially problematic for DNA barcoding, which attempts to characterize all living organisms by using a short fragment of the mitochondrial cytochrome c oxidase I (COI) gene. Here, we study the effect of numts on DNA barcoding based on phylogenetic and barcoding analyses of numt and mtDNA sequences in two divergent lineages of arthropods: grasshoppers and crayfish. Single individuals from both organisms have numts of the COI gene, many of which are highly divergent from orthologous mtDNA sequences, and DNA barcoding analysis incorrectly overestimates the number of unique species based on the standard metric of 3% sequence divergence. Removal of numts based on a careful examination of sequence characteristics, including indels, in-frame stop codons, and nucleotide composition, drastically reduces the incorrect inferences of the number of unique species, but even such rigorous quality control measures fail to identify certain numts. We also show that the distribution of numts is lineage-specific and the presence of numts cannot be known a priori. Whereas DNA barcoding strives for rapid and inexpensive generation of molecular species tags, we demonstrate that the presence of COI numts makes this goal difficult to achieve when numts are prevalent and can introduce serious ambiguity into DNA barcoding.
Conflict of interest statement
The authors declare no conflict of interest.
Figures


Similar articles
-
Mitochondrial pseudogenes in the nuclear genome of Aedes aegypti mosquitoes: implications for past and future population genetic studies.BMC Genet. 2009 Mar 6;10:11. doi: 10.1186/1471-2156-10-11. BMC Genet. 2009. PMID: 19267896 Free PMC article.
-
Assessing the effects of primer specificity on eliminating numt coamplification in DNA barcoding: a case study from Orthoptera (Arthropoda: Insecta).Mol Ecol Resour. 2010 Jul;10(4):615-27. doi: 10.1111/j.1755-0998.2009.02823.x. Epub 2010 Jan 3. Mol Ecol Resour. 2010. PMID: 21565066
-
Profile hidden Markov model sequence analysis can help remove putative pseudogenes from DNA barcoding and metabarcoding datasets.BMC Bioinformatics. 2021 May 19;22(1):256. doi: 10.1186/s12859-021-04180-x. BMC Bioinformatics. 2021. PMID: 34011275 Free PMC article.
-
The Mighty NUMT: Mitochondrial DNA Flexing Its Code in the Nuclear Genome.Biomolecules. 2023 Apr 27;13(5):753. doi: 10.3390/biom13050753. Biomolecules. 2023. PMID: 37238623 Free PMC article. Review.
-
[Nuclear mitochondrial pseudogenes].Mol Biol (Mosk). 2010 May-Jun;44(3):405-17. Mol Biol (Mosk). 2010. PMID: 20608164 Review. Russian.
Cited by
-
Frequency matrix approach demonstrates high sequence quality in avian BARCODEs and highlights cryptic pseudogenes.PLoS One. 2012;7(8):e43992. doi: 10.1371/journal.pone.0043992. Epub 2012 Aug 27. PLoS One. 2012. PMID: 22952842 Free PMC article.
-
Environmental genes and genomes: understanding the differences and challenges in the approaches and software for their analyses.Brief Bioinform. 2015 Sep;16(5):745-58. doi: 10.1093/bib/bbv001. Epub 2015 Feb 11. Brief Bioinform. 2015. PMID: 25673291 Free PMC article.
-
The effect of geographical scale of sampling on DNA barcoding.Syst Biol. 2012 Oct;61(5):851-69. doi: 10.1093/sysbio/sys037. Epub 2012 Mar 7. Syst Biol. 2012. PMID: 22398121 Free PMC article.
-
Diversity, Distribution and Host Blood Meal Analysis of Adult Black Flies (Diptera: Simuliidae) from Thailand.Insects. 2024 Jan 21;15(1):74. doi: 10.3390/insects15010074. Insects. 2024. PMID: 38276823 Free PMC article.
-
Comparison of detection methods and genome quality when quantifying nuclear mitochondrial insertions in vertebrate genomes.Front Genet. 2022 Nov 22;13:984513. doi: 10.3389/fgene.2022.984513. eCollection 2022. Front Genet. 2022. PMID: 36482890 Free PMC article.
References
-
- Funk DJ, Omland KE. Species-level paraphyly and polyphyly: Frequency, causes, and consequences, with insights from animal mitochondrial DNA. Annu Rev Ecol Evol Syst. 2003;34:397–423.
-
- Rubinoff D, Cameron S, Will K. A genomic perspective on the shortcomings of mitochondrial DNA for “barcoding” identification. J Hered. 2006;97:581–594. - PubMed
-
- Campbell NJH, Barker SC. The novel mitochondrial gene arrangement of the cattle tick, Boophilus microplus: Fivefold tandem repetition of a coding region. Mol Biol Evol. 1999;16:732–740. - PubMed
-
- Frey JE, Frey B. Origin of intra-individual variation in PCR-amplified mitochondrial cytochrome oxidase I of Thrips tabaci (Thysanoptera: Thripidae): Mitochondrial heteroplasmy or nuclear integration? Hereditas. 2004;140:92–98. - PubMed
Publication types
MeSH terms
Substances
Associated data
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
LinkOut - more resources
Full Text Sources