A phased chromosome-level genome of the annelid tubeworm Galeolaria caespitosa
- PMID: 40269537
- PMCID: PMC12400804
- DOI: 10.1093/jhered/esaf025
A phased chromosome-level genome of the annelid tubeworm Galeolaria caespitosa
Abstract
Haplotype-resolved (phased) genome assemblies are emerging as important assets for genomic studies of species with high heterozygosity, but remain lacking for key animal lineages. Here, we use PacBio HiFi and Omni-C technologies to assemble the first phased, annotated, chromosome-level genome for any annelid: the reef-building tubeworm Galeolaria caespitosa (Serpulidae). The assembly is 803.5 Mbp long (scaffold N50 = 76.5 Mbp) for haplotype 1 and 789.3 Mbp long (scaffold N50 = 75.4 Mbp) for haplotype 2, which are arranged into 11 pairs of chromosomes showing no sign of sex chromosomes. This compares with cytological analyses reporting 12 to 13 pairs in G. caespitosa's closest relatives, including species that are protandrous hermaphrodites. We combined long-read and short-read transcriptome sequencing to annotate both haplotypes, resulting in 30,495 predicted proteins for haplotype 1, 27,423 proteins for haplotype two, and 79.5% of proteins with at least one functional annotation. We also assembled a mitochondrial genome 23 kbp long, annotating all genes typically found in mitochondrial DNA apart from those coding the 16S ribosomal subunit (rrnL) and the protein atp8-a short, fast-evolving mitochondrial gene missing in other metazoans. Comparing G. caespitosa's genome to those of three other annelids reveals limited collinearity despite 36.0% of shared orthologous gene clusters (4,238 of 11,763 clusters counted in G. caespitosa), suggesting extensive chromosomal rearrangements among lineages. New high-quality annelid genomes may help resolve the genetic and evolutionary basis of this diversity.
Keywords: Annelida; Polychaeta; annotated reference genome; ecosystem engineer; marine invertebrate; mitochondria.
© The American Genetic Association. 2025.
Figures
References
-
- Allio R, Schomaker-Bastos A, Romiguier J, Prosdocimi F, Nabholz B, Delsuc F. MitoFinder: Efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics. Mol Ecol Resour. 2020:20:892–905. https://doi.org/ 10.1111/1755-0998.13160 - DOI - PMC - PubMed
-
- Andrade SCS, Novo M, Kawauchi GY, Worsaae K, Pleijel F, Giribet G, Rouse GW. Articulating “Archiannelids”: phylogenomics and annelid relationships, with emphasis on meiofaunal taxa. Mol Biol Evol. 2015:32:2860–2875. https://doi.org/ 10.1093/molbev/msv157 - DOI - PubMed
-
- Arthur, D., & Vassilvitskii, S. k-means++: the advantages of careful seeding. Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, New Orleans, Louisiana; 2007.
-
- Bairoch A. The ENZYME database in 2000. Nucleic Acids Res. 2000:28:304–305. https://doi.org/ 10.1093/nar/28.1.304 - DOI - PMC - PubMed
-
- Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, Pütz J, Middendorf M, Stadler PF. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 2013:69:313–319. https://doi.org/ 10.1016/j.ympev.2012.08.023 - DOI - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
