Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Jun;109(3):177-191.
doi: 10.1007/s11103-020-01104-w. Epub 2021 Feb 18.

Current status and impending progress for cassava structural genomics

Affiliations
Review

Current status and impending progress for cassava structural genomics

Jessica B Lyons et al. Plant Mol Biol. 2022 Jun.

Erratum in

Abstract

We demystify recent advances in genome assemblies for the heterozygous staple crop cassava (Manihot esculenta), and highlight key cassava genomic resources. Cassava, Manihot esculenta Crantz, is a crop of societal and agricultural importance in tropical regions around the world. Genomics provides a platform for accelerated improvement of cassava's nutritional and agronomic traits, as well as for illuminating aspects of cassava's history including its path towards domestication. The highly heterozygous nature of the cassava genome is widely recognized. However, the full extent and context of this heterozygosity has been difficult to reveal because of technological limitations within genome sequencing. Only recently, with several new long-read sequencing technologies coming online, has the genomics community been able to tackle some similarly difficult genomes. In light of these recent advances, we provide this review to document the current status of the cassava genome and genomic resources and provide a perspective on what to look forward to in the coming years.

Keywords: Cassava; Crop improvement; Genomics; Heterozygous genomes; Phased genomes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Fig. 1
Fig. 1
Global cassava yields show potential for improvement. a Cassava global median yields trends (1961–2017). Natural cubic spline smoothed trendline (blue) and standard errors (shaded ribbon). Cassava yields rose significantly since the 1980s, possibly due to improvements in germplasm and breeding. More recently however, there has been a plateau in yields. b Large disparity in global yields around cassava producing regions also suggests there is still potential for large scale gains. Yields in each cassava producing country plotted relative to the maximum produced in 2017 (32 Tons/ha, Lao People’s Democratic Republic) (Source: FAOSTAT, December 2019)
Fig. 2
Fig. 2
Reference genome assembly strategies. a Generating a haploid representation (reference assembly) of a diploid inbred genome is relatively straightforward. Due to homozygosity, sequence reads from the two haplotypes assemble together. b The heterozygosity present in a diploid outbred genome means that sequences from maternal and paternal haplotypes (blue and gold) will tend to assemble separately. In this case, to generate a haploid reference assembly, researchers can either combine maternal and paternal contigs into a haploid representation for each chromosome (haplotype-mosaic reference assembly), or they can try to fully assemble the maternal and paternal chromosomes, choosing one or the other to represent each chromosome in the reference (haplotype-phased reference assembly). Gray, assembly gaps
Fig. 3
Fig. 3
Repeats, genes, and recombination frequency in the AM560-2 v7 cassava genome. Repeat density (light blue lines), gene count (blue lines), and recombination rate (gold lines) are plotted. Genic regions are anticorrelated with repetitive regions (Y-axis). Regions with low recombination frequency tend to co-occur with areas of high repeat density, thus, these hard-to-assemble regions also tend not to benefit from scaffolding information provided by a genetic map. Repeat density is measured as the fraction of bases that are annotated as repetitive in 1 Mb sliding windows sampled every 100 kb along the AM560-2 v7 chromosomes. The gene count was also taken with 1 Mb sliding windows every 100 kb. Recombination rate is measured as the number of recombinations per 1 Mb sliding window (100 kb step) using the first derivative of a natural cubic spline-smoothed fit line to the ICGMC 2014 framework map anchored to the v7 genome sequence. The marker positions of the framework map are plotted with vertical black ticks below the X-axis

References

    1. Alonge M, Wang X, Benoit M, Soyk S, Pereira L, Zhang L, Suresh H, et al. Major impacts of widespread structural variation on gene expression and crop improvement in Tomato. Cell. 2020;182(1):145–146. doi: 10.1016/j.cell.2020.05.021. - DOI - PMC - PubMed
    1. Amuge T, Berger DK, Katari MS, Myburg AA, Goldman SL, Ferguson ME. A time series transcriptome analysis of cassava (Manihot esculenta Crantz) varieties challenged with ugandan cassava brown streak virus. Sci Rep. 2017;7(1):1–21. doi: 10.1038/s41598-017-09617-z. - DOI - PMC - PubMed
    1. Andersen MD, Busk PK, Svendsen I, Møller BL. Cytochromes P-450 from Cassava (Manihot esculenta Crantz) catalyzing the first steps in the biosynthesis of the cyanogenic glucosides linamarin and lotaustralin. Cloning, functional expression in Pichia pastoris, and substrate specificity of the isolated recombinant enzymes. J Biol Chem. 2000;275(3):1966–1975. doi: 10.1074/jbc.275.3.1966. - DOI - PubMed
    1. Andrade LRB, Sousa MBE, Oliveira EJ, de Resende MDV, Azevedo CF. Cassava yield traits predicted by genomic selection methods. PloS ONE. 2019;14(11):e0224920. doi: 10.1371/journal.pone.0224920. - DOI - PMC - PubMed
    1. Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408(6814):796–815. doi: 10.1038/35048692. - DOI - PubMed

LinkOut - more resources