Mind the gap! The mitochondrial control region and its power as a phylogenetic marker in echinoids

Omri Bronstein^{1

2}, Andreas Kroh³, Elisabeth Haring^{4

5}

Affiliations

¹ Natural History Museum Vienna, Geological-Palaeontological Department, 1010, Vienna, Austria. omribronstein@gmail.com.
² Natural History Museum Vienna, Central Research Laboratories, 1010, Vienna, Austria. omribronstein@gmail.com.
³ Natural History Museum Vienna, Geological-Palaeontological Department, 1010, Vienna, Austria.
⁴ Natural History Museum Vienna, Central Research Laboratories, 1010, Vienna, Austria.
⁵ Department of Integrative Zoology, University of Vienna, Vienna, Austria.

PMID: 29848319
PMCID: PMC5977486
DOI: 10.1186/s12862-018-1198-x

Mind the gap! The mitochondrial control region and its power as a phylogenetic marker in echinoids

Omri Bronstein et al. BMC Evol Biol. 2018.

. 2018 May 30;18(1):80.

doi: 10.1186/s12862-018-1198-x.

Authors

Omri Bronstein^{1

2}, Andreas Kroh³, Elisabeth Haring^{4

5}

Affiliations

¹ Natural History Museum Vienna, Geological-Palaeontological Department, 1010, Vienna, Austria. omribronstein@gmail.com.
² Natural History Museum Vienna, Central Research Laboratories, 1010, Vienna, Austria. omribronstein@gmail.com.
³ Natural History Museum Vienna, Geological-Palaeontological Department, 1010, Vienna, Austria.
⁴ Natural History Museum Vienna, Central Research Laboratories, 1010, Vienna, Austria.
⁵ Department of Integrative Zoology, University of Vienna, Vienna, Austria.

PMID: 29848319
PMCID: PMC5977486
DOI: 10.1186/s12862-018-1198-x

Abstract

Background: In Metazoa, mitochondrial markers are the most commonly used targets for inferring species-level molecular phylogenies due to their extremely low rate of recombination, maternal inheritance, ease of use and fast substitution rate in comparison to nuclear DNA. The mitochondrial control region (CR) is the main non-coding area of the mitochondrial genome and contains the mitochondrial origin of replication and transcription. While sequences of the cytochrome oxidase subunit 1 (COI) and 16S rRNA genes are the prime mitochondrial markers in phylogenetic studies, the highly variable CR is typically ignored and not targeted in such analyses. However, the higher substitution rate of the CR can be harnessed to infer the phylogeny of closely related species, and the use of a non-coding region alleviates biases resulting from both directional and purifying selection. Additionally, complete mitochondrial genome assemblies utilizing next generation sequencing (NGS) data often show exceptionally low coverage at specific regions, including the CR. This can only be resolved by targeted sequencing of this region.

Results: Here we provide novel sequence data for the echinoid mitochondrial control region in over 40 species across the echinoid phylogenetic tree. We demonstrate the advantages of directly targeting the CR and adjacent tRNAs to facilitate complementing low coverage NGS data from complete mitochondrial genome assemblies. Finally, we test the performance of this region as a phylogenetic marker both in the lab and in phylogenetic analyses, and demonstrate its superior performance over the other available mitochondrial markers in echinoids.

Conclusions: Our target region of the mitochondrial CR (1) facilitates the first thorough investigation of this region across a wide range of echinoid taxa, (2) provides a tool for complementing missing data in NGS experiments, and (3) identifies the CR as a powerful, novel marker for phylogenetic inference in echinoids due to its high variability, lack of selection, and high compatibility across the entire class, outperforming conventional mitochondrial markers.

Keywords: Control region; Echinoidea; Mitochondrial markers; Molecular phylogeny; NGS; Sea urchins.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

**Fig. 1**
Representation of echinoid complete mitochondrial genomes assembled from NGS data, showing gene annotation and coverage. The annotated genomes are represented by four echinoid species: *Hemicentrotus pulcherrimus*, *Strongylocentrotus fragilis*, *Mesocentrotus franciscanus,* and *Strongylocentrotus intermedius*, corresponding to GenBank accession numbers: KC898202, KC898198, KC898199, and KC898200, respectively. Annotations are given at the outer margin of the external circle. Concentric circles represent the corresponding coverage for each of the represented species mitogenomes. Data was obtained from Kober and Bernardi [86, 87]. Enlarged segment illustrates the position of the various primers used in the current study. Coverage was calculated in BRIG [88], after read mapping with Bowtie2 [89] (using the predefined alignment threshold “very-sensitive”). Annotations are based on those for *H. pulcherrimus* (GenBank accession no. NC_023771) and radial plots generated using BRIG

**Fig. 2**
Pairwise tree comparisons for phylogenetic trees based on commonly used mitochondrial markers. Trees include the two most commonly used phylogenetic mitochondrial markers: a fragment of the *cytochrome c oxidase subunit 1* (a) gene and a fragment of the *16S ribosomal RNA* (c) as well as the novel tRNAs and control region (e). To facilitate independent comparisons, the genetically inferred trees were restricted to the 35 publicly available complete echinoid mitochondrial genomes. Genera represented by more than one species were collapsed and are depicted by single branches. Supporting values (> 0.85 posterior probabilities and > 75% ML bootstrap values) are shown next to nodes. Topological comparisons between the genetically inferred trees and current classification (b, d, f) (see text for details) were visualised using Phylo.io [62]. Colour scale for the comparison metric (a variant of the Jaccard Index as implemented in Phylo.io) ranges from 0 (subtrees completely different) to 1 (subtree structure of the respective node is identical)

**Fig. 3**
Substitution saturation plot of the CRA marker based on the *CRA-All* dataset. The number of transitions (s) and transversions (v) is plotted against F84 genetic distance. A linear correlation is sustained for both transitions and transversions as expected in the absence of saturation

**Fig. 4**
Phylogenetic tree reconstruction of the echinoid control region and adjacent areas (CRA). The BI tree presented is based on 86 unique haplotypes retrieved from a total of 110 sequences, 405 bp long (see Table 1 for details on the sequences used for this tree). Supporting values (> 0.5 posterior probabilities and > 50% ML bootstrap values) are shown above the nodes

**Fig. 5**
Coverage (orange curve) and GC content (black curve; 200 bp sliding window, 10 bp step width) through the mitogenome of *Hemicentrotus pulcherrimus* (GenBank accession no. KC898202) illustrating moderate (R² = 0.335), but highly significant correlation (t-test, p < 10^− 100) between the two graphs. Note extreme drop of coverage towards the end of the CR (highlighted in grey), which coincides with a slight decrease in GC-content, but shows a much stronger negative excursion than other GC-poor areas in the mitogenome of this species (e.g. at nucleotide positions 4.4, 8.5, or 12.6 kb)

See this image and copyright information in PMC

Cited by

The first complete mitochondrial genome of the sand dollar Sinaechinocyamus mai (Echinoidea: Clypeasteroida).
Lin JP, Tsai MH, Kroh A, Trautman A, Machado DJ, Chang LY, Reid R, Lin KT, Bronstein O, Lee SJ, Janies D. Lin JP, et al. Genomics. 2020 Mar;112(2):1686-1693. doi: 10.1016/j.ygeno.2019.10.007. Epub 2019 Oct 17. Genomics. 2020. PMID: 31629878 Free PMC article.
The Detection and Partial Localisation of Heteroplasmic Mutations in the Mitochondrial Genome of Patients with Diabetic Retinopathy.
Malik AN, Rosa HS, de Menezes ES, Tamang P, Hamid Z, Naik A, Parsade CK, Sivaprasad S. Malik AN, et al. Int J Mol Sci. 2019 Dec 11;20(24):6259. doi: 10.3390/ijms20246259. Int J Mol Sci. 2019. PMID: 31835862 Free PMC article.
First Report of Rickettsia conorii in Hyalomma kumari Ticks.
Ullah S, Alouffi A, Almutairi MM, Islam N, Rehman G, Ul Islam Z, Ahmed H, Júnior IDSV, Labruna MB, Tanaka T, Ali A. Ullah S, et al. Animals (Basel). 2023 Apr 27;13(9):1488. doi: 10.3390/ani13091488. Animals (Basel). 2023. PMID: 37174525 Free PMC article.
A Comparative Analysis of Mitogenomes in Species of the Tapinoma nigerrimum Complex and Other Species of the Genus Tapinoma (Formicidae, Dolichoderinae).
Ruiz-Mena A, Mora P, Rico-Porras JM, Kaufmann B, Seifert B, Palomeque T, Lorite P. Ruiz-Mena A, et al. Insects. 2024 Dec 2;15(12):957. doi: 10.3390/insects15120957. Insects. 2024. PMID: 39769559 Free PMC article.
Distant hybrids of Heliocidaris crassispina (♀) and Strongylocentrotus intermedius (♂): identification and mtDNA heteroplasmy analysis.
Zhan Y, Sun J, Li Y, Cui D, Zhang W, Yang L, Chang Y. Zhan Y, et al. BMC Evol Biol. 2020 Aug 11;20(1):101. doi: 10.1186/s12862-020-01667-8. BMC Evol Biol. 2020. PMID: 32781979 Free PMC article.

See all "Cited by" articles

References

1. Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977;74(12):5463–5467. doi: 10.1073/pnas.74.12.5463. - DOI - PMC - PubMed
1. Liu L, Li Y, Li S, Hu N, He Y, Pong R, Lin D, Lu L, Law M. Comparison of next-generation sequencing systems. J Biomed Biotechnol. 2012;2012:1–11. - PMC - PubMed
1. Goodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet. 2016;17(6):333–351. doi: 10.1038/nrg.2016.49. - DOI - PMC - PubMed
1. Wang W, Wei Z, Lam T-W, Wang J. Next generation sequencing has lower sequence coverage and poorer SNP-detection capability in the regulatory regions. Sci Rep. 2011;1:55. doi: 10.1038/srep00055. - DOI - PMC - PubMed
1. Ekblom R, Smeds L, Ellegren H. Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria. BMC Genomics. 2014;15(1):467. doi: 10.1186/1471-2164-15-467. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Miscellaneous
- NCI CPTAC Assay Portal
- Xenbase

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed