Long walk to genomics: History and current approaches to genome sequencing and assembly
- PMID: 31890139
- PMCID: PMC6926122
- DOI: 10.1016/j.csbj.2019.11.002
Long walk to genomics: History and current approaches to genome sequencing and assembly
Abstract
Genomes represent the starting point of genetic studies. Since the discovery of DNA structure, scientists have devoted great efforts to determine their sequence in an exact way. In this review we provide a comprehensive historical background of the improvements in DNA sequencing technologies that have accompanied the major milestones in genome sequencing and assembly, ranging from early sequencing methods to Next-Generation Sequencing platforms. We then focus on the advantages and challenges of the current technologies and approaches, collectively known as Third Generation Sequencing. As these technical advancements have been accompanied by progress in analytical methods, we also review the bioinformatic tools currently employed in de novo genome assembly, as well as some applications of Third Generation Sequencing technologies and high-quality reference genomes.
Keywords: BAC, Bacterial Artificial Chromosome; Bioinformatics; Genome assembly; HGP, Human Genome Project; HMW, high molecular weight; HapMap, haplotype map; NGS, Next Generation Sequencing; Next-generation; OLC, Overlap-Layout-Consensus; QV, Quality Value (QV); Reference; SBS, Sequencing by Synthesis; SMRT, Single Molecule Real-Time; SNPs, Single Nucleotide Polymorphisms; SRA, Short Read Archive; SV, Structural Variant; Sequencing; TGS, Third Generation Sequencing; Third-generation; WGS, Whole Genome Sequencing; ZMW, Zero-Mode Waveguide; bp, base pair; dNTPs, deoxynucleoside triphosphates; ddNTP, 2,3-dideoxynucleoside triphosphate.
© 2019 The Authors.
Conflict of interest statement
G.F. had one travel sponsored by Bionano Genomics. A.M.G., G.R.G. and L.G. declare no conflicts of interest.
Figures




Similar articles
-
Third generation sequencing transforming plant genome research: Current trends and challenges.Gene. 2025 Mar 10;940:149187. doi: 10.1016/j.gene.2024.149187. Epub 2024 Dec 24. Gene. 2025. PMID: 39724994 Review.
-
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8. BMC Genomics. 2016. PMID: 27556636 Free PMC article.
-
Software for pre-processing Illumina next-generation sequencing short read sequences.Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014. Source Code Biol Med. 2014. PMID: 24955109 Free PMC article.
-
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies.J Vis Exp. 2021 Aug 20;(174). doi: 10.3791/62872. J Vis Exp. 2021. PMID: 34487123
-
From Samples to Germline and Somatic Sequence Variation: A Focus on Next-Generation Sequencing in Melanoma Research.Life (Basel). 2022 Nov 21;12(11):1939. doi: 10.3390/life12111939. Life (Basel). 2022. PMID: 36431075 Free PMC article. Review.
Cited by
-
Current Challenges in Vaccinology.Front Immunol. 2020 Jun 25;11:1181. doi: 10.3389/fimmu.2020.01181. eCollection 2020. Front Immunol. 2020. PMID: 32670279 Free PMC article. Review.
-
Advances in experimental and computational methodologies for the study of microbial-surface interactions at different omics levels.Front Microbiol. 2022 Nov 28;13:1006946. doi: 10.3389/fmicb.2022.1006946. eCollection 2022. Front Microbiol. 2022. PMID: 36519168 Free PMC article. Review.
-
In it for the long run: perspectives on exploiting long-read sequencing in livestock for population scale studies of structural variants.Genet Sel Evol. 2023 Jan 31;55(1):9. doi: 10.1186/s12711-023-00783-5. Genet Sel Evol. 2023. PMID: 36721111 Free PMC article. Review.
-
Full-length transcriptome sequencing provides insights into the evolution of apocarotenoid biosynthesis in Crocus sativus.Comput Struct Biotechnol J. 2020 Mar 26;18:774-783. doi: 10.1016/j.csbj.2020.03.022. eCollection 2020. Comput Struct Biotechnol J. 2020. PMID: 32280432 Free PMC article.
-
Detection accuracy and clinical applications of DP-TOF mass spectrometry.J Int Med Res. 2024 May;52(5):3000605241255568. doi: 10.1177/03000605241255568. J Int Med Res. 2024. PMID: 38819085 Free PMC article.
References
-
- Watson J.D., Crick F.H.C. Molecular structure of nucleic acids. Nature. 1953;171:737–738. - PubMed
-
- Holley R.W., Apgar J., Everett G.A., Madison J.T., Marquisee M. Structure of a ribonucleic acid. Science. 1965;147:1462–1465. - PubMed
-
- Wu R., Kaiser A.D. Structure and base sequence in the cohesive ends of bacteriophage lambda DNA. J Mol Biol. 1968;35:523–537. - PubMed
Publication types
LinkOut - more resources
Full Text Sources
Research Materials