Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 7;11(8):jkab150.
doi: 10.1093/g3journal/jkab150.

Sequencing, assembly and annotation of the whole-insect genome of Lymantria dispar dispar, the European gypsy moth

Affiliations

Sequencing, assembly and annotation of the whole-insect genome of Lymantria dispar dispar, the European gypsy moth

Michael E Sparks et al. G3 (Bethesda). .

Abstract

The European gypsy moth, Lymantria dispar dispar (LDD), is an invasive insect and a threat to urban trees, forests and forest-related industries in North America. For use as a comparator with a previously published genome based on the LD652 pupal ovary-derived cell line, as well as whole-insect genome sequences obtained from the Asian gypsy moth subspecies L. dispar asiatica and L. dispar japonica, the whole-insect LDD genome was sequenced, assembled and annotated. The resulting assembly was 998 Mb in size, with a contig N50 of 662 Kb and a GC content of 38.8%. Long interspersed nuclear elements constitute 25.4% of the whole-insect genome, and a total of 11,901 genes predicted by automated gene finding encoded proteins exhibiting homology with reference sequences in the NCBI NR and/or UniProtKB databases at the most stringent similarity cutoff level (i.e., the gold tier). These results will be especially useful in developing a better understanding of the biology and population genetics of L. dispar and the genetic features underlying Lepidoptera in general.

Keywords: European gypsy moth; Illumina short-read polishing; Lepidoptera; PacBio long-read assembly; automated gene finding; gypsy moth genomics; whole-genome sequencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
L. dispar dispar genome assembly and annotation pipeline. The automated pipeline used to assemble the European gypsy moth genome, as well as annotate repetitive content and nuclear protein coding genes, is a modified version of the methods used for annotating the Asian gypsy moth genomes, L. dispar asiatica and L. dispar japonica (Hebert et al. 2019). (The Bombyx mori image is copyright Freepik 2018 and reproduced here with permission.)

References

    1. Bao Z, Eddy SR.. 2002. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res. 12:1269–1276. - PMC - PubMed
    1. Buchfink B, Xie C, Huson DH.. 2015. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 12:59–60. - PubMed
    1. Djoumad A, Nisole A, Zahiri R, Freschi L, Picq S, et al. 2017. Comparative analysis of mitochondrial genomes of geographic variants of the gypsy moth, Lymantria dispar, reveals a previously undescribed genotypic entity. Sci Rep. 7:14245. - PMC - PubMed
    1. Ellinghaus D, Kurtz S, Willhoeft U.. 2008. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics. 9:18. - PMC - PubMed
    1. Fu L, Niu B, Zhu Z, Wu S, Li W.. 2012. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 28:3150–3152. - PMC - PubMed

Publication types

LinkOut - more resources