CGC1, a new reference genome for Caenorhabditis elegans
- PMID: 40664475
- PMCID: PMC12315879
- DOI: 10.1101/gr.280274.124
CGC1, a new reference genome for Caenorhabditis elegans
Abstract
The original 100.3 Mb reference genome for Caenorhabditis elegans, generated from the wild-type laboratory strain N2, has been crucial for analysis of C. elegans since 1998 and has been considered complete since 2005. Unexpectedly, this long-standing reference was shown to be incomplete in 2019 by a genome assembly from the N2-derived strain VC2010. Moreover, genetically divergent versions of N2 have arisen over decades of research and hindered reproducibility of C. elegans genetics and genomics. Here we provide a 106.4 Mb gap-free, telomere-to-telomere genome assembly of C. elegans, generated from CGC1, an isogenic derivative of the N2 strain. We use improved long-read sequencing and manual assembly of 43 recalcitrant genomic regions to overcome deficiencies of prior N2 and VC2010 assemblies and to assemble tandem repeat loci, including a 772 kb sequence for the 45S rRNA genes. Although many differences from earlier assemblies come from repeat regions, unique additions to the genome are also found. Of 19,972 protein-coding genes in the N2 assembly, 19,790 (99.1%) encode products that are unchanged in the CGC1 assembly. The CGC1 assembly also may encode 183 new protein-coding and 163 new ncRNA genes. CGC1 thus provides both a completely defined reference genome and corresponding isogenic wild-type strain for C. elegans, allowing unique opportunities for model and systems biology.
© 2025 Ichikawa et al.; Published by Cold Spring Harbor Laboratory Press.
Figures
Update of
-
CGC1, a new reference genome for Caenorhabditis elegans.bioRxiv [Preprint]. 2024 Dec 6:2024.12.04.626850. doi: 10.1101/2024.12.04.626850. bioRxiv. 2024. Update in: Genome Res. 2025 Aug 1;35(8):1902-1918. doi: 10.1101/gr.280274.124. PMID: 39677790 Free PMC article. Updated. Preprint.
References
-
- Antipov D, Rautiainen M, Nurk S, Walenz BP, Solar SJ, Phillippy AM, Koren S. 2025. Verkko2 integrates proximity-ligation data with long-read De Bruijn graphs for efficient telomere-to-telomere genome assembly, phasing, and scaffolding. Genome Res 35: 1583–1594. 10.1101/gr.280383.124 - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous