Sequencing, assembly and annotation of the whole-insect genome of Lymantria dispar dispar, the European gypsy moth
- PMID: 33930134
- PMCID: PMC8496242
- DOI: 10.1093/g3journal/jkab150
Sequencing, assembly and annotation of the whole-insect genome of Lymantria dispar dispar, the European gypsy moth
Abstract
The European gypsy moth, Lymantria dispar dispar (LDD), is an invasive insect and a threat to urban trees, forests and forest-related industries in North America. For use as a comparator with a previously published genome based on the LD652 pupal ovary-derived cell line, as well as whole-insect genome sequences obtained from the Asian gypsy moth subspecies L. dispar asiatica and L. dispar japonica, the whole-insect LDD genome was sequenced, assembled and annotated. The resulting assembly was 998 Mb in size, with a contig N50 of 662 Kb and a GC content of 38.8%. Long interspersed nuclear elements constitute 25.4% of the whole-insect genome, and a total of 11,901 genes predicted by automated gene finding encoded proteins exhibiting homology with reference sequences in the NCBI NR and/or UniProtKB databases at the most stringent similarity cutoff level (i.e., the gold tier). These results will be especially useful in developing a better understanding of the biology and population genetics of L. dispar and the genetic features underlying Lepidoptera in general.
Keywords: European gypsy moth; Illumina short-read polishing; Lepidoptera; PacBio long-read assembly; automated gene finding; gypsy moth genomics; whole-genome sequencing.
Published by Oxford University Press on behalf of Genetics Society of America 2021.
Figures
References
-
- Buchfink B, Xie C, Huson DH.. 2015. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 12:59–60. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous