Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 30:2022:gigabyte64.
doi: 10.46471/gigabyte.64. eCollection 2022.

Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes

Affiliations

Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes

Akito Y Kawahara et al. GigaByte. .

Abstract

Insect silk is a versatile biomaterial. Lepidoptera and Trichoptera display some of the most diverse uses of silk, with varying strength, adhesive qualities, and elastic properties. Silk fibroin genes are long (>20 Kbp), with many repetitive motifs that make them challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (Plodia interpunctella) and genomic sequences for the caddisfly Eubasilissa regina. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO complete = 99.3%/95.2%), with complete and contiguous recovery of silk heavy fibroin gene sequences. We show that HiFi long-read sequencing is helpful for understanding genes with long, repetitive regions.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1.
Figure 1.
Length of assembled heavy fibroin (HFib) gene for Plodia interpunctella using two approaches. Top shows HiFi assembly, bottom shows Illumina assembly. In the HiFi genome, we recovered the entire length of the sequence, but in the Illumina assembly we could not assemble the genome through the repetitive region.
Figure 2.
Figure 2.
Schematic of the identity and ordering of repeat motifs in Plodia interpunctella. On the right panel are the repetitive units with the N-terminus at the beginning and the C-terminus at the end. The numbers in parentheses refer to the number of times that particular motif is repeated across the gene. The color corresponds with the ordering of the repeats shown on the left. The gene is split into two panels, starting in the left panel and continuing in the right panel. “X” indicates a variable site.
Figure 3.
Figure 3.
Schematic of the identity and ordering of repeat motifs in Eubasilissa regina. On the right panel are the repetitive units with the N-terminus at the beginning and the C-terminus at the end. The numbers in parentheses refer to the number of times that particular motif is repeated across the gene. The color corresponds with the ordering of the repeats shown on the left. The gene is split into two panels, starting in the left panel and continuing in the right panel.
Figure 4.
Figure 4.
BlobPlot for Eubasilissa regina.
Figure 5.
Figure 5.
BlobPlot for Plodia interpunctella.

References

    1. Numata K. . How to define and study structural proteins as biopolymer materials. Polym. J., 2020; 52: 1043–1056. doi: 10.1038/s41428-020-0362-5. - DOI
    1. Davies PL, Hew CL. . Biochemistry of fish antifreeze proteins. FASEB J., 1990; 4: 2460–2468. doi: 10.1096/fasebj.4.8.2185972. - DOI - PubMed
    1. Kono N, Nakamura H, Ohtoshi R et al. The bagworm genome reveals a unique fibroin gene that provides high tensile strength. Commun. Biol., 2019; 2: 1–9. doi: 10.1038/s42003-019-0412-8. - DOI - PMC - PubMed
    1. Stewart RJ, Wang CS. . Adaptation of caddisfly larval silks to aquatic habitats by phosphorylation of H-fibroin serines. Biomacromolecules, 2010; 11: 969–974. doi: 10.1021/bm901426d. - DOI - PubMed
    1. Ashton NN, Roe DR, Weiss RB et al. Self-tensioning aquatic caddisfly silk: Ca2+-dependent structure, strength, and load cycle hysteresis. Biomacromolecules, 2013; 14: 3668–3681. doi: 10.1021/bm401036z. - DOI - PubMed