Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
- PMID: 33910595
- PMCID: PMC8082918
- DOI: 10.1186/s13059-021-02336-9
Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
Abstract
Background: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly.
Results: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100-300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization.
Conclusions: Our results indicate that even in the "simple" case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.
Keywords: Assembly; Duplications; Long reads; Mitochondrial DNA; Repeats; Sequencing; Vertebrate.
Conflict of interest statement
V. C., S. M., and D. F. are employees of Oxford Nanopore Technologies Limited. J. K. is Chief Scientific Officer of Pacific Biosciences.
Figures





Similar articles
-
Towards complete and error-free genome assemblies of all vertebrate species.Nature. 2021 Apr;592(7856):737-746. doi: 10.1038/s41586-021-03451-0. Epub 2021 Apr 28. Nature. 2021. PMID: 33911273 Free PMC article.
-
Rapid Low-Cost Assembly of the Drosophila melanogaster Reference Genome Using Low-Coverage, Long-Read Sequencing.G3 (Bethesda). 2018 Oct 3;8(10):3143-3154. doi: 10.1534/g3.118.200162. G3 (Bethesda). 2018. PMID: 30018084 Free PMC article.
-
Widespread false gene gains caused by duplication errors in genome assemblies.Genome Biol. 2022 Sep 27;23(1):205. doi: 10.1186/s13059-022-02764-1. Genome Biol. 2022. PMID: 36167596 Free PMC article.
-
Oxford Nanopore MinION Sequencing and Genome Assembly.Genomics Proteomics Bioinformatics. 2016 Oct;14(5):265-279. doi: 10.1016/j.gpb.2016.05.004. Epub 2016 Sep 17. Genomics Proteomics Bioinformatics. 2016. PMID: 27646134 Free PMC article. Review.
-
[Mitogenome assembly strategies and software applications in the genome era].Yi Chuan. 2019 Nov 20;41(11):979-993. doi: 10.16288/j.yczz.19-227. Yi Chuan. 2019. PMID: 31735702 Review. Chinese.
Cited by
-
Insights into phylogenetic relationships and gene rearrangements: complete mitogenomes of two sympatric species in the genus Rana (Anura, Ranidae).Zookeys. 2024 Oct 21;1216:63-82. doi: 10.3897/zookeys.1216.131847. eCollection 2024. Zookeys. 2024. PMID: 39474245 Free PMC article.
-
The genome sequence of the Atlantic horse mackerel, Trachurus trachurus (Linnaeus 1758).Wellcome Open Res. 2022 Mar 31;7:118. doi: 10.12688/wellcomeopenres.17813.1. eCollection 2022. Wellcome Open Res. 2022. PMID: 36874570 Free PMC article.
-
Chromosome-level assembly and annotation of the Xyrichtys novacula (Linnaeus, 1758) genome.DNA Res. 2023 Oct 1;30(5):dsad021. doi: 10.1093/dnares/dsad021. DNA Res. 2023. PMID: 37797305 Free PMC article.
-
An annotated chromosome-scale reference genome for Eastern black-eared wheatear (Oenanthe melanoleuca).G3 (Bethesda). 2023 Jun 1;13(6):jkad088. doi: 10.1093/g3journal/jkad088. G3 (Bethesda). 2023. PMID: 37097035 Free PMC article.
-
Mitochondrial genomes revisited: why do different lineages retain different genes?BMC Biol. 2024 Jan 25;22(1):15. doi: 10.1186/s12915-024-01824-1. BMC Biol. 2024. PMID: 38273274 Free PMC article. Review.
References
-
- Karnkowska A, Vacek V, Zubáčová Z, Treitli SC, Petrželková R, Eme L, Novák L, Žárský V, Barlow LD, Herman EK, Soukal P, Hroudová M, Doležal P, Stairs CW, Roger AJ, Eliáš M, Dacks JB, Vlček Č, Hampl V. A eukaryote without a mitochondrial organelle. Curr Biol. 2016;26(10):1274–1284. doi: 10.1016/j.cub.2016.03.053. - DOI - PubMed
-
- Kolesnikov AA, Gerasimov ES. Diversity of mitochondrial genome organization. Biochemistry. 2012;77:1424–1435. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials