Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 10;11(9):uhae185.
doi: 10.1093/hr/uhae185. eCollection 2024 Sep.

The gap-free genome of Forsythia suspensa illuminates the intricate landscape of centromeres

Affiliations

The gap-free genome of Forsythia suspensa illuminates the intricate landscape of centromeres

Jian Cui et al. Hortic Res. .

Abstract

Forsythia suspensa, commonly known as weeping forsythia, holds significance in traditional medicine and horticulture. Despite its ecological and cultural importance, the existing reference genome presents challenges with duplications and gaps, hindering in-depth genomic analyses. Here, we present a Telomere-to-Telomere (T2T) assembly of the F. suspensa genome, integrating Oxford Nanopore Technologies (ONT) ultra-long, Hi-C datasets, and high-fidelity (HiFi) sequencing data. The T2T reference genome (Fsus-CHAU) consists of 14 chromosomes, totaling 688.79 Mb, and encompasses 33 932 predicted protein-coding genes. Additionally, we characterize functional centromeres in the F. suspensa genome by developing a specific CENH3 antibody. We demonstrate that centromeric regions in F. suspensa exhibit a diverse array of satellites, showcasing distinctive types with unconventional lengths across various chromosomes. This discovery offers implications for the adaptability of CENH3 and the potential influence on centromere dynamics. Furthermore, after assessing the insertion time of full-length LTRs within centromeric regions, we found that they are older compared to those across the entire genome, contrasting with observations in other species where centromeric retrotransposons are typically young. We hypothesize that asexual reproduction may impact retrotransposon dynamics, influencing centromere evolution. In conclusion, our T2T assembly of the F. suspensa genome, accompanied by detailed genomic annotations and centromere analysis, significantly enhances F. suspensa potential as a subject of study in fields ranging from ecology and horticulture to traditional medicine.

PubMed Disclaimer

Conflict of interest statement

No conflict of interest was declared.

Figures

Figure 1
Figure 1
Summary of the Fsus-CHAU genome and its associated phenotypic characteristics. A Circos plot. Starting from the outermost to the innermost ring: (a) chromosome, (b) gene number density, (c) LTR density, (d) GC content, and (e) syntenic block. B The phenotypes of different tissues, including folwer, fruit, stem, leaf, and the whole plant in Forsythia suspensa. C The karyotype analysis of 14 chromosome pairs in F. suspensa. D Chromatin interaction map generated by Hi-C for the Fsus-CHAU genome.
Figure 2
Figure 2
Collinearity between Fsus-CHAU and weeping forsythia. A Structural variations between the Fsus-CHAU and weeping forsythia genome. B Summary of genomic annotation: the ‘Other’ genomic composition encompasses non-protein-coding genes, unclassified repeat regions, and genomic segments that defy annotation as either genes or repeats. C Proportion of different repeat types in the Fsus-CHAU genome.
Figure 3
Figure 3
Genomic mapping of functional centromeres in Forsythia suspensa. A Multiple alignments of CENH3 homologs from different plant lineages including Zea mays, Oryza sativa, Arabidopsis thaliana, Glycine max, and F. suspensa. The peptide used to generate the anti-CENH3 antibody, with the N-terminal tail and the conserved CENP-A Targeting Domain (CATD) marked on the sequence. B Immunostaining of nucleus in the F. suspensa using the anti-CENH3 antibody. Arrows indicate CENH3 signals. Bar = 10 μm. C FISH assay using CENH3-ChIPed DNA as probes. The boxed insets show high-magnification image of the chromosome indicated by the arrow. Chromosomes are stained with DAPI. Bar = 10 μm. D This schematic overview illustrates the results of the CENH3 ChIP-seq assay in F. suspensa. The mapped reads density was plotted in 100-kb windows along the 14 chromosomes. The x-axis shows the positions on each chromosome. The y-axis represents ChIP/input ratio of the normalized density of uniquely mapped reads on each position, which was calculated based on the ratio of reads per million/100.
Figure 4
Figure 4
Characterization of the diverse centromeric satellites in Forsythia suspensa. A Characteristics of centromeres within F. suspensa genome assemblies. The enrichment of CENH3 [log2(ChIP/Input)] depicted across centromere 9. B A circos plot depicting tandem repeats identified by TRASH in de novo mode within the Fsus-CHAU genome assembly. C Dot plots comparing the centromeres using a search window of 100 bp. Forward- and reverse-strand similarity are indicated by different points. D FISH signals of CentFs356, CentFs365, and CentFs733 in F. suspensa. Bar = 10 μm. E A histogram illustrating the lengths (bp) of tandem repeats identified in the Fsus-CHAU genome assembly by TRASH. F Proportion of the genome occupied by different tandem repeats.
Figure 5
Figure 5
The LTR-RTs in the centromeres of Forsythia suspensa. A The distribution of full-length LTR-RTs, fragmented LTRs, and centromeric satellites within the extended 20 Mb region surrounding the centromere of Chromosome 7. B Phylogenetic tree of full-length LTR-RTs from the Fsus-CHAU genome, with branches differentiated by their locations within and outside the centromeres. C Dot plot of centromeric full-length LTR-RTs from the Fsus-CHAU genome using a 100-bp search window. D CENH3 enrichment level [log2(ChIP/Input)] around CentFs satellites, centromeric full-length LTR-RTs (n = 295) and non-centromeric full-length LTR-RTs (n = 3176). E Comparison of the insertion timing between full-length LTR-RTs located within centromeres (Cen_LTR-RTs) and those outside of centromeres (Non-cen_LTR-RTs). F Non-centromeric full-length LTR-RTs (Non-cen_LTR-RTs): the histogram displays the distribution of LTR identity percentages for non-cen_LTR-RTs, with a sample size of 3176 elements. The median LTR identity is 0.982, as indicated by the vertical red line. The x-axis represents the percentage of LTR identity, while the y-axis shows the count of LTR-RTs within each identity range. G Centromeric full-length LTR-RTs (Cen_LTR-RTs): the histogram shows the distribution of LTR identity percentages for centromeric LTR-RTs, with a sample size of 295 elements. The median LTR identity is 0.979, marked by the vertical red line. The x-axis represents the percentage of LTR identity, and the y-axis indicates the count of LTR-RTs within each identity range.

References

    1. Liu Q, Chen X, Guo L. et al. Comparison of the growth adaptation of six Forthysia species in the quaternary red soil of South China. Res Soil Water Conserv. 2020;27:357–69
    1. Ha YH, Kim C, Choi K. et al. Molecular phylogeny and dating of Forsythieae (Oleaceae) provide insight into the Miocene history of Eurasian temperate shrubs. Front Plant Sci. 2018;9:99. - PMC - PubMed
    1. Li LF, Cushman SA, He YX. et al. Genome sequencing and population genomics modeling provide insights into the local adaptation of weeping forsythia. Hortic Res. 2020;7:130. - PMC - PubMed
    1. Li Y, Wang F, Pei N. et al. The updated weeping forsythia genome reveals the genomic basis for the evolution and the forsythin and forsythoside a biosynthesis. Hortic Plant J. 2023;9:1149–61
    1. Liu C, Huang Y, Guo X. et al. Young retrotransposons and non-B DNA structures promote the establishment of dominant rye centromere in the 1RS.1BL fused centromere. New Phytol. 2024;241:607–22 - PubMed

LinkOut - more resources