Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 23;116(5):702-712.
doi: 10.1093/jhered/esaf025.

A phased chromosome-level genome of the annelid tubeworm Galeolaria caespitosa

Affiliations

A phased chromosome-level genome of the annelid tubeworm Galeolaria caespitosa

Monique van Dorssen et al. J Hered. .

Abstract

Haplotype-resolved (phased) genome assemblies are emerging as important assets for genomic studies of species with high heterozygosity, but remain lacking for key animal lineages. Here, we use PacBio HiFi and Omni-C technologies to assemble the first phased, annotated, chromosome-level genome for any annelid: the reef-building tubeworm Galeolaria caespitosa (Serpulidae). The assembly is 803.5 Mbp long (scaffold N50 = 76.5 Mbp) for haplotype 1 and 789.3 Mbp long (scaffold N50 = 75.4 Mbp) for haplotype 2, which are arranged into 11 pairs of chromosomes showing no sign of sex chromosomes. This compares with cytological analyses reporting 12 to 13 pairs in G. caespitosa's closest relatives, including species that are protandrous hermaphrodites. We combined long-read and short-read transcriptome sequencing to annotate both haplotypes, resulting in 30,495 predicted proteins for haplotype 1, 27,423 proteins for haplotype two, and 79.5% of proteins with at least one functional annotation. We also assembled a mitochondrial genome 23 kbp long, annotating all genes typically found in mitochondrial DNA apart from those coding the 16S ribosomal subunit (rrnL) and the protein atp8-a short, fast-evolving mitochondrial gene missing in other metazoans. Comparing G. caespitosa's genome to those of three other annelids reveals limited collinearity despite 36.0% of shared orthologous gene clusters (4,238 of 11,763 clusters counted in G. caespitosa), suggesting extensive chromosomal rearrangements among lineages. New high-quality annelid genomes may help resolve the genetic and evolutionary basis of this diversity.

Keywords: Annelida; Polychaeta; annotated reference genome; ecosystem engineer; marine invertebrate; mitochondria.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Circos plot of the 11 chromosomes in the phased assembly. From outside to inside: chromosomes of haplotype 1 (blue) and haplotype 2 (green), GC%, gene density (red), repeat density (blue), and Galeolaria  caespitosa adults retracted into tubes at low tide (photo credit: E. Chirgwin). GC%, gene density, and repeat density were estimated in 500 Kb windows with a 100 Kb step.
Fig. 2.
Fig. 2.
Comparative analyses of annelid genomes. (B) Counts of orthologous gene clusters shared between, Streblospio benedicti, Helobdella robusta, and Capitella teleta. (C) Collinearity analysis of Helobdella robusta, Streblospio benedicti, and Galeolaria caespitosa genomes. Note that no synteny was detected between genomes of H. robusta and S. benedicti. Genomes are labelled by chromosome number except in the case of H. robusta, which is labelled by scaffold.

References

    1. Allio R, Schomaker-Bastos A, Romiguier J, Prosdocimi F, Nabholz B, Delsuc F. MitoFinder: Efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics. Mol Ecol Resour. 2020:20:892–905. https://doi.org/ 10.1111/1755-0998.13160 - DOI - PMC - PubMed
    1. Andrade SCS, Novo M, Kawauchi GY, Worsaae K, Pleijel F, Giribet G, Rouse GW. Articulating “Archiannelids”: phylogenomics and annelid relationships, with emphasis on meiofaunal taxa. Mol Biol Evol. 2015:32:2860–2875. https://doi.org/ 10.1093/molbev/msv157 - DOI - PubMed
    1. Arthur, D., & Vassilvitskii, S. k-means++: the advantages of careful seeding. Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, New Orleans, Louisiana; 2007.
    1. Bairoch A. The ENZYME database in 2000. Nucleic Acids Res. 2000:28:304–305. https://doi.org/ 10.1093/nar/28.1.304 - DOI - PMC - PubMed
    1. Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, Pütz J, Middendorf M, Stadler PF. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 2013:69:313–319. https://doi.org/ 10.1016/j.ympev.2012.08.023 - DOI - PubMed

LinkOut - more resources