Comparison of long-read methods for sequencing and assembly of a plant genome
- PMID: 33347571
- PMCID: PMC7751402
- DOI: 10.1093/gigascience/giaa146
Comparison of long-read methods for sequencing and assembly of a plant genome
Abstract
Background: Sequencing technologies have advanced to the point where it is possible to generate high-accuracy, haplotype-resolved, chromosome-scale assemblies. Several long-read sequencing technologies are available, and a growing number of algorithms have been developed to assemble the reads generated by those technologies. When starting a new genome project, it is therefore challenging to select the most cost-effective sequencing technology, as well as the most appropriate software for assembly and polishing. It is thus important to benchmark different approaches applied to the same sample.
Results: Here, we report a comparison of 3 long-read sequencing technologies applied to the de novo assembly of a plant genome, Macadamia jansenii. We have generated sequencing data using Pacific Biosciences (Sequel I), Oxford Nanopore Technologies (PromethION), and BGI (single-tube Long Fragment Read) technologies for the same sample. Several assemblers were benchmarked in the assembly of Pacific Biosciences and Nanopore reads. Results obtained from combining long-read technologies or short-read and long-read technologies are also presented. The assemblies were compared for contiguity, base accuracy, and completeness, as well as sequencing costs and DNA material requirements.
Conclusions: The 3 long-read technologies produced highly contiguous and complete genome assemblies of M. jansenii. At the time of sequencing, the cost associated with each method was significantly different, but continuous improvements in technologies have resulted in greater accuracy, increased throughput, and reduced costs. We propose updating this comparison regularly with reports on significant iterations of the sequencing technologies.
Keywords: BGI; ONT; Oxford Nanopore Technologies; PacBio; Pacific Biosciences; PromethION; Sequel; assembly; long reads; single-tube long fragment read; stLFR.
© The Author(s) 2020. Published by Oxford University Press GigaScience.
Conflict of interest statement
Employees of BGI (W.T., I.H., Q.Y., B.Y., O.W., M.X, P.W.), MGI (H.W.), and Complete Genomics (E.A., Q.M., R.D., B.A.P.) have stock holdings in BGI. The authors declare that they have no other competing interests.
Figures
Comment in
-
Improvements in the sequencing and assembly of plant genomes.GigaByte. 2021 Jun 10;2021:gigabyte24. doi: 10.46471/gigabyte.24. eCollection 2021. GigaByte. 2021. PMID: 36824328 Free PMC article.
References
-
- Gross C, Weston P. Macadamia jansenii (Proteaceae), a new species from central Queensland. Aust Syst Bot. 1992;5(6):725–8.
-
- The four macadamias. http://www.wildmacadamias.org.au/the-four-macadamias. Accessed 14 February 2020.
-
- Chase MW. Relationships between the families of flowering plants. In: Henry RJ, ed., Plant Diversity and Evolution: Genotypic and Phenotypic Variation in Higher Plants. Wallingford, UK; Cambridge, MA: CABI; 2005.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
