Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Nov 25;12(11):e1002005.
doi: 10.1371/journal.pbio.1002005. eCollection 2014 Nov.

The first myriapod genome sequence reveals conservative arthropod gene content and genome organisation in the centipede Strigamia maritima

Ariel D Chipman  1 David E K Ferrier  2 Carlo Brena  3 Jiaxin Qu  4 Daniel S T Hughes  5 Reinhard Schröder  6 Montserrat Torres-Oliva  3 Nadia Znassi  3 Huaiyang Jiang  4 Francisca C Almeida  7 Claudio R Alonso  8 Zivkos Apostolou  9 Peshtewani Aqrawi  4 Wallace Arthur  10 Jennifer C J Barna  11 Kerstin P Blankenburg  4 Daniela Brites  12 Salvador Capella-Gutiérrez  13 Marcus Coyle  4 Peter K Dearden  14 Louis Du Pasquier  15 Elizabeth J Duncan  14 Dieter Ebert  15 Cornelius Eibner  10 Galina Erikson  16 Peter D Evans  17 Cassandra G Extavour  18 Liezl Francisco  4 Toni Gabaldón  19 William J Gillis  20 Elizabeth A Goodwin-Horn  21 Jack E Green  3 Sam Griffiths-Jones  22 Cornelis J P Grimmelikhuijzen  23 Sai Gubbala  4 Roderic Guigó  24 Yi Han  4 Frank Hauser  23 Paul Havlak  25 Luke Hayden  10 Sophie Helbing  26 Michael Holder  4 Jerome H L Hui  27 Julia P Hunn  28 Vera S Hunnekuhl  3 LaRonda Jackson  4 Mehwish Javaid  4 Shalini N Jhangiani  4 Francis M Jiggins  29 Tamsin E Jones  18 Tobias S Kaiser  30 Divya Kalra  4 Nathan J Kenny  27 Viktoriya Korchina  4 Christie L Kovar  4 F Bernhard Kraus  31 François Lapraz  32 Sandra L Lee  4 Jie Lv  25 Christigale Mandapat  4 Gerard Manning  33 Marco Mariotti  24 Robert Mata  4 Tittu Mathew  4 Tobias Neumann  34 Irene Newsham  4 Dinh N Ngo  4 Maria Ninova  22 Geoffrey Okwuonu  4 Fiona Ongeri  4 William J Palmer  29 Shobha Patil  4 Pedro Patraquim  8 Christopher Pham  4 Ling-Ling Pu  4 Nicholas H Putman  25 Catherine Rabouille  35 Olivia Mendivil Ramos  2 Adelaide C Rhodes  36 Helen E Robertson  32 Hugh M Robertson  37 Matthew Ronshaugen  22 Julio Rozas  38 Nehad Saada  4 Alejandro Sánchez-Gracia  38 Steven E Scherer  4 Andrew M Schurko  21 Kenneth W Siggens  3 DeNard Simmons  4 Anna Stief  39 Eckart Stolle  26 Maximilian J Telford  32 Kristin Tessmar-Raible  40 Rebecca Thornton  4 Maurijn van der Zee  41 Arndt von Haeseler  42 James M Williams  21 Judith H Willis  43 Yuanqing Wu  4 Xiaoyan Zou  4 Daniel Lawson  5 Donna M Muzny  4 Kim C Worley  4 Richard A Gibbs  4 Michael Akam  3 Stephen Richards  4
Affiliations

The first myriapod genome sequence reveals conservative arthropod gene content and genome organisation in the centipede Strigamia maritima

Ariel D Chipman et al. PLoS Biol. .

Abstract

Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific life history.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The phylogenetic position of the centipedes (Chilopoda), with respect to other arthropods, according to the currently best-supported phylogeny.
(See text for details). The four traditionally accepted arthropod classes are marked in bold.
Figure 2
Figure 2. Plot showing that DNA from a male individual contains a distinct fraction of scaffolds that is underrepresented (black arrow), and presumably derives from heterogametic sex chromosomes.
No such fraction is present in the sequenced DNA of two individual females. The data underlying this plot is presented in File S4.
Figure 3
Figure 3. Conserved macro synteny signal between S. maritima and the chordate lancelet B. floridae clustered into ancestral linkage groups.
Each dot represents a pair of genes, one in B. floridae, one in S. maritima, assigned to the same gene family by our orthology analysis. The ancestral linkage group identifiers refer to groups of scaffolds from the S. maritima (SmALG) or B. floridae (BfALG) assemblies, as detailed in File S2. The identification of ALGs is described in the SI. Note that two S. maritima scaffolds were divided across ALGs, and so appear multiple times in File S2.
Figure 4
Figure 4. Homeobox gene clusters.
(A) The Hox gene cluster of S. maritima compared to the cluster that can be deduced for the ancestral arthropod. S. maritima provides the first instance of an arthropod Hox cluster with tight linkage of an Even-skipped (Eve) gene (see text). Hox3 is the only gene missing from the S. maritima Hox cluster, but may be present elsewhere in the genome on a separate scaffold (see main text and Text S1 for details). The S. maritima cluster is drawn approximately to scale and spans 457 kb from the start codon of labial (lab) to the start codon of Eve-b. Arrows denote the transcriptional orientation. (B) Remains of clustering and linkage of ANTP class genes in S. maritima. The blue boxes are genes belonging to the ANTP class. The brown box is a gene belonging to the HNF class. The orange box is a gene belonging to the SINE class. The intergenic distances are indicated in kb. (C) Clusters of non-ANTP class homeobox genes in S. maritima. The green boxes are genes belonging to the TALE class. The red boxes are genes belonging to the PRD class. The intergenic distances are indicated in kb, except in the case of Rx-Hbn as these genes are overlapping but with opposite transcriptional orientations. All scaffold numbers are indicated in brackets.
Figure 5
Figure 5. Expansion of chemosensory receptor families.
(A) Phylogenetic relationships among S. maritima (Smar), I. scapularis (Isca), D. pulex (Dpul), and a few insect GRs that encode for sugar, fructose, and carbon dioxide receptors (Dmel, D. melanogaster, and Amel, A. mellifera). (B) Phylogenetic relationships among S. maritima, I. scapularis, and a few D. melanogaster IRs and IgluR genes (the suffix at the end of the protein names indicates: i, incomplete and p, pseudogene).
Figure 6
Figure 6. Ancestral protein kinases are extensively lost during arthropod evolution.
S. maritima is an exception and retains the largest number of ancestral kinases. Numbers of kinase subfamilies in selected species are shown in parentheses after species names. The gains, losses, and inferred content of common ancestors are listed on internal branches. Kinases found in at least two species from human, C. elegans and Nematostella vectenesis were used as an outgroup.
Figure 7
Figure 7. Presence and absence of immunity genes in different arthropods.
Counts of immune genes are shown for S. maritima, D. pulex , A. mellifera , T. castaneum, Anopheles gambiae, and D. melanogaster . ∼, identity of the gene is uncertain; -, not investigated.
Figure 8
Figure 8. Dscam diversity caused either by gene and/or exon duplication in different Metazoa.
aOnly canonical Dscam paralogues were considered. bIn D. melanogaster and D. pulex the paralogue Dscam-L2 has two Ig7 alternative coding exons. cPotential number of Dscam isoforms, circulating in one individual, produced by mutually exclusive alternative splicing of duplicated exons.
Figure 9
Figure 9. Frequency histogram of CpG(o/e) observed in S. maritima gene bodies.
The y-axis depicts the number of genes with the specific CpG(o/e) values given on the x-axis. The distribution of CpG(o/e) in S. maritima is a trimodal distribution, with a low-CpG(o/e) peak consistent with the presence of historical DNA methylation in S. maritima and the presence of a high CpG(o/e) peak. The data underlying this plot are available in File S4.
Figure 10
Figure 10. Arthropod phylogenetic tree (with nematode outgroup) showing selected events of gene loss, gene gain, and gene family expansions.
Main taxa are listed on the tips, with representative species for which there is a fully sequenced genome listed below. Major nodes are also named. Data from the genome of S. maritima allow us to map when in arthropod evolution these events occurred, even when these events did not occur on the centipede lineage. A plausible node for the occurrence of each event is marked and colour-coded, with the possible range marked with a thin line of the same colour. The events, listed from left to right are: (1) Dscam alternative splicing as a strategy for increasing immune diversity is known from D. melanogaster, as well as the crustacean D. pulex, and thus probably evolved in the lineage leading to pancrustacea, after the split from centipedes. (2) Several wnt genes have been lost in holometabolous insects, leaving only seven of the 13 ancestral families. This loss occurred gradually over arthropod evolution, but reached its peak at the base of the Holometabola. (3) Selenoproteins are rare in insects. The presence of a large number of selenoproteins in S. maritima as well as in other non-insect arthropods suggests that the loss of many selenoproteins occurred at the base of the Insecta. (4) Expansion of chemosensory gene families occurred independently in different arthropod lineages as they underwent terrestrialisation. The OR family is expanded in insects only. (5) Chemosensory genes of the GR and IR genes have undergone a lineage specific expansion in the genome of S. maritima. As these are probably also linked with terrestrialisation we suggest that this expansion happened at the base of the Chilopoda, but it could have also occurred later in the lineage leading to S. maritima. (6) Cuticular proteins of the RR-1 family are numerous in the S. maritima genome. They are found in other arthropods, but not in chelicerates nor in any non-arthropod species. This suggests that the RR-1 subfamily evolved at the base of the Mandibulata. (7) The genome of S. maritima has a large complement of wnt genes, but is missing wnt8. Since this gene is found in the Diplopod G. marginata (a species without a fully sequenced genome), the loss most likely occurred at the base of the Chilopoda. (8) Unlike the situation in D. melanogaster, immune diversity in the S. maritima genome is achieved through multiple copies of the Dscam gene. This expansion of the family could have happened at any time after the split between Myriapoda and Pancrustacea. (9) Both circadian rhythm genes and many light receptors are missing in S. maritima. These losses are most likely due to the subterranean life style of geophilomorph centipedes and are probably specific to this group. However, we cannot rule out the possibility that they were lost somewhere in the lineage leading to myriapods. (10) The existence of JH signalling in S. maritima as well as in all other arthropods studied to date strengthens the idea that this signalling system evolved with the exoskeleton of arthropods, though its origins could be even more ancient and date back to the origin of moulting at the base of the Ecdysozoa.

Comment in

  • What goes "99-thump?".
    Roberts RG. Roberts RG. PLoS Biol. 2014 Nov 25;12(11):e1002006. doi: 10.1371/journal.pbio.1002006. eCollection 2014 Nov. PLoS Biol. 2014. PMID: 25422891 Free PMC article. No abstract available.
  • Genome evolution: groping in the soil interstices.
    Minelli A. Minelli A. Curr Biol. 2015 Mar 2;25(5):R194-6. doi: 10.1016/j.cub.2015.01.018. Curr Biol. 2015. PMID: 25734267

References

    1. Arthropod Genomes Consortium (2014) List of sequenced arthropod genomes. Available: http://arthropodgenomes.org/wiki/Sequenced_genomes.
    1. Bracken-Grissom H, Collins AG, Collins T, Crandall K, Distel D, et al. (2014) The Global Invertebrate Genomics Alliance (GIGA): developing community resources to study diverse invertebrate genomes. J Hered 105: 1–18. - PMC - PubMed
    1. Edgecombe GD (2011) Phylogenetic relationships of Myriapoda. Minelli A, editor. The Myriapoda. Leiden: Brill. pp. 1–20.
    1. Giribet G, Edgecombe GD, Wheeler WC (2001) Arthropod phylogeny based on eight molecular loci and morphology. Nature 157–160. - PubMed
    1. Rota-Stabelli O, Telford MJ (2008) A multi criterion approach for the selection of optimal outgroups in phylogeny: recovering some support for Mandibulata over Myriochelata using mitogenomics. Mol Phylogenet Evol 48: 103–111. - PubMed

Publication types

MeSH terms