Genome assembly in the telomere-to-telomere era
- PMID: 38649458
- DOI: 10.1038/s41576-024-00718-w
Genome assembly in the telomere-to-telomere era
Abstract
Genome sequences largely determine the biology and encode the history of an organism, and de novo assembly - the process of reconstructing the genome sequence of an organism from sequencing reads - has been a central problem in bioinformatics for four decades. Until recently, genomes were typically assembled into fragments of a few megabases at best, but now technological advances in long-read sequencing enable the near-complete assembly of each chromosome - also known as telomere-to-telomere assembly - for many organisms. Here, we review recent progress on assembly algorithms and protocols, with a focus on how to derive near-telomere-to-telomere assemblies. We also discuss the additional developments that will be required to resolve remaining assembly gaps and to assemble non-diploid genomes.
© 2024. Springer Nature Limited.
Update of
-
Genome assembly in the telomere-to-telomere era.ArXiv [Preprint]. 2023 Aug 15:arXiv:2308.07877v1. ArXiv. 2023. Update in: Nat Rev Genet. 2024 Sep;25(9):658-670. doi: 10.1038/s41576-024-00718-w. PMID: 37645045 Free PMC article. Updated. Preprint.
References
-
- C. elegans Sequencing Consortium. Genome sequence of the nematode C. elegans: a platform for investigating biology. Science 282, 2012–2018 (1998). - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
