Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Sep:76:118-147.
doi: 10.1016/j.ibmb.2016.07.005. Epub 2016 Aug 12.

Multifaceted biological insights from a draft genome sequence of the tobacco hornworm moth, Manduca sexta

Michael R Kanost  1 Estela L Arrese  2 Xiaolong Cao  3 Yun-Ru Chen  4 Sanjay Chellapilla  5 Marian R Goldsmith  6 Ewald Grosse-Wilde  7 David G Heckel  8 Nicolae Herndon  5 Haobo Jiang  3 Alexie Papanicolaou  9 Jiaxin Qu  10 Jose L Soulages  2 Heiko Vogel  8 James Walters  11 Robert M Waterhouse  12 Seung-Joon Ahn  8 Francisca C Almeida  13 Chunju An  14 Peshtewani Aqrawi  10 Anne Bretschneider  8 William B Bryant  15 Sascha Bucks  7 Hsu Chao  10 Germain Chevignon  16 Jayne M Christen  17 David F Clarke  18 Neal T Dittmer  17 Laura C F Ferguson  19 Spyridoula Garavelou  20 Karl H J Gordon  21 Ramesh T Gunaratna  3 Yi Han  10 Frank Hauser  22 Yan He  3 Hanna Heidel-Fischer  8 Ariana Hirsh  23 Yingxia Hu  3 Hongbo Jiang  24 Divya Kalra  10 Christian Klinner  7 Christopher König  7 Christie Kovar  10 Ashley R Kroll  25 Suyog S Kuwar  8 Sandy L Lee  10 Rüdiger Lehman  26 Kai Li  27 Zhaofei Li  28 Hanquan Liang  29 Shanna Lovelace  30 Zhiqiang Lu  28 Jennifer H Mansfield  23 Kyle J McCulloch  31 Tittu Mathew  10 Brian Morton  23 Donna M Muzny  10 David Neunemann  8 Fiona Ongeri  10 Yannick Pauchet  8 Ling-Ling Pu  10 Ioannis Pyrousis  20 Xiang-Jun Rao  32 Amanda Redding  33 Charles Roesel  34 Alejandro Sanchez-Gracia  13 Sarah Schaack  25 Aditi Shukla  23 Guillaume Tetreau  35 Yang Wang  3 Guang-Hua Xiong  36 Walther Traut  37 Tom K Walsh  18 Kim C Worley  10 Di Wu  17 Wenbi Wu  15 Yuan-Qing Wu  10 Xiufeng Zhang  3 Zhen Zou  36 Hannah Zucker  38 Adriana D Briscoe  31 Thorsten Burmester  39 Rollie J Clem  15 René Feyereisen  40 Cornelis J P Grimmelikhuijzen  22 Stavros J Hamodrakas  41 Bill S Hansson  7 Elisabeth Huguet  16 Lars S Jermiin  18 Que Lan  42 Herman K Lehman  43 Marce Lorenzen  44 Hans Merzendorfer  45 Ioannis Michalopoulos  20 David B Morton  46 Subbaratnam Muthukrishnan  17 John G Oakeshott  18 Will Palmer  47 Yoonseong Park  48 A Lorena Passarelli  15 Julio Rozas  13 Lawrence M Schwartz  49 Wendy Smith  50 Agnes Southgate  51 Andreas Vilcinskas  52 Richard Vogt  53 Ping Wang  35 John Werren  33 Xiao-Qiang Yu  54 Jing-Jiang Zhou  55 Susan J Brown  5 Steven E Scherer  10 Stephen Richards  10 Gary W Blissard  4
Affiliations

Multifaceted biological insights from a draft genome sequence of the tobacco hornworm moth, Manduca sexta

Michael R Kanost et al. Insect Biochem Mol Biol. 2016 Sep.

Abstract

Manduca sexta, known as the tobacco hornworm or Carolina sphinx moth, is a lepidopteran insect that is used extensively as a model system for research in insect biochemistry, physiology, neurobiology, development, and immunity. One important benefit of this species as an experimental model is its extremely large size, reaching more than 10 g in the larval stage. M. sexta larvae feed on solanaceous plants and thus must tolerate a substantial challenge from plant allelochemicals, including nicotine. We report the sequence and annotation of the M. sexta genome, and a survey of gene expression in various tissues and developmental stages. The Msex_1.0 genome assembly resulted in a total genome size of 419.4 Mbp. Repetitive sequences accounted for 25.8% of the assembled genome. The official gene set is comprised of 15,451 protein-coding genes, of which 2498 were manually curated. Extensive RNA-seq data from many tissues and developmental stages were used to improve gene models and for insights into gene expression patterns. Genome wide synteny analysis indicated a high level of macrosynteny in the Lepidoptera. Annotation and analyses were carried out for gene families involved in a wide spectrum of biological processes, including apoptosis, vacuole sorting, growth and development, structures of exoskeleton, egg shells, and muscle, vision, chemosensation, ion channels, signal transduction, neuropeptide signaling, neurotransmitter synthesis and transport, nicotine tolerance, lipid metabolism, and immunity. This genome sequence, annotation, and analysis provide an important new resource from a well-studied model insect species and will facilitate further biochemical and mechanistic experimental studies of many biological systems in insects.

Keywords: Innate immunity; Insect; Insect biochemistry; Lepidoptera; Moth; Synteny; Tobacco hornworm.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The Manduca sexta gene repertoire and molecular species phylogeny. Approximately half of the 15,451 M. sexta genes have identifiable orthologs in the representative genomes of mammals, human and mouse (pie chart, blue), suggesting that these are ancient genes likely to have been present in the metazoan ancestor. A further 32% of M. sexta genes exhibit orthology to genes from the other seven representative insect species (green, 22% or 3427 genes), or only to genes from the other five lepidopteran species (pink, 10% or 1588 genes). Of the remaining genes, some have orthologs (orange) or homologs (red) in other animal species (other metazoan species from OrthoDB), leaving 417 M. sexta genes (app. 3%) without any recognizable homologs (yellow, e-value cutoff 1e-3). Employing aligned protein sequences of universal single-copy orthologs to estimate the molecular species phylogeny rooted with the starlet sea anemone, Nematostella vectensis, shows that the Lepidoptera and Diptera exhibit the fastest rates of molecular divergence. All nodes have 100% bootstrap support. The boxplots show the distributions of percent amino acid identities between M. sexta proteins and their best-reciprocal-hits from mammal species (MAM; median 34.9%), insect species (INS; median 40.1%), and lepidopteran species (LEP; median 60.2%). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2
Fig. 2
Remarkably conserved synteny among the Lepidoptera. A. Predicted contiguous ancestral regions (CARs) for common ancestors at the major nodes of the insect species phylogeny. Pie charts show distributions of CAR lengths (number of orthologous anchor genes) with diameters proportional to the total number of anchor genes in the CARs. The molecular species phylogeny was built from single-copy orthologs from four representative species from four major groups – Lepidoptera, Drosophila, Culicidae, and Hymenoptera – and rooted with the body louse, Pediculus humanus. B. Examining the fates of the 1329 Holometabola CAR gene neighbor pairs in the sixteen extant species classifies them as kept (dark green, maintained neighbors), likely kept (light green, inferred maintained neighbors), lost (red, no longer neighbors), or ambiguous (orange, missing orthologs). C. Evolutionary distances between species pairs, in terms of branch lengths from the phylogeny in panel A, are plotted against the percentage of orthologous gene anchors maintained in synteny (top) and the syntenic pair to gene ratio (bottom, number of neighboring gene pairs/number of genes maintained in synteny), with linear regressions of all species pairs (solid lines) and all non-lepidopteran species pairs (dashed lines). s.s., substitutions per site; HYM, Hymenoptera; LEP, Lepidoptera; DRO, Drosophila; CUL, Culicidae; Aaeg, Aedes aegypti; Aatr, Anopheles atroparvus; Agam, Anopheles gambiae; Amel, Apis mellifera; Bmor, Bombyx mori; Cqui, Culex quinquefasciatus; Dmel, Drosophila melanogaster; Dmoj, Drosophila mojavensis; Dple, Danaus plexippus; Dpse, Drosophila pseudoobscura; Dvir, Drosophila virilis; Hmel, Heliconius melpomene; Hsal, Harpegnathos saltator; Msex, Manduca sexta; Nvit, Nasonia vitripennis; Pbar, Pogonomyrmex barbatus. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3
Fig. 3
Overview of gene transcripts and their relative levels in the 52 cDNA libraries. (a) Distribution of genes and their transcripts based on splicing variants per gene; (b) Percentages of OGS2.0 genes (left) and sums of their FPKM values (right) in the five FPKM categories. The 52 libraries are in the same order as described in He et al. (2015).
Fig. 4
Fig. 4
Gene expression of 68 gut serine proteases and their close homologs in various tissue samples. (a) The mRNA levels, as represented by log2(FPKM +1) values, are shown in the gradient heat map from blue (0) to red (≥10) (Cao et al., 2015c); (b) Average FPKM values in whole body, Malpighian tubules (MT), and other tissues; (c) stage-dependent transcription in midgut tissues from 2nd instar larvae (L2), late 3rd instar (L3L), early (L4E) and late (L4L) 4th instar, 1–3 h, day 1, pre-wandering (PW) and wandering (W) 5th instar larvae, day 1 and days 15–18 pupae, and days 3–5 adults. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5
Fig. 5
Manduca Hox gene cluster. (A) A phylogenetic tree was constructed using translations of the Hox and Shx gene homeodomains from Manduca sexta (Ms), Heliconius melpomene (Hm), Bombyx mori (Bm) and Danaus plexippus (Dp). Homeodomains were extracted from genomic annotations and aligned using ClustalW. A maximum likelihood tree was generated with 100 bootstrap replicates using PhyML with a LG + G model with parameters sampled from the data. Manduca orthologs are highlighted. (B) A single Manduca scaffold contained the majority of the Hox cluster, including four Shx genes. The orientation of ShxD is reversed relative to the other Lepidoptera. An Apis mellifera (bee) Hox cluster is displayed as a representative ancestral insect cluster.
Fig. 6
Fig. 6
Summary of the families of genes coding for chitin metabolism enzymes and chitin binding proteins (CBPs) in the M. sexta genome. Dotted lines indicate the binding of CBPs to chitin by their chitin binding domains (CBD – orange), and lines with an arrow indicate that chitin metabolism enzymes and their functions in chitin synthesis (chitin synthases, CHS – green), deacetylation (chitin deacetylases, CDA – blue) and degradation (chitinases, CHT – red). For CDA and CHT, a phylogenetic tree has been constructed using the neighbor-joining method, with 1000 replications of bootstrap analyses, implemented in MEGA 6.06 (Tamura et al., 2011). CPAP: cuticular proteins analogous to peritrophin; PMP: peritrophic matrix protein. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 7
Fig. 7
Location, structure, and phytogeny of the chorion gene cluster and chorion genes of M. sexta. A. Possible configurations of the chorion locus. Represented are the four different relative orientations between chorion gene containing scaffolds, scaffold00032 and scaffold00064. The two annotated contiguous chorion gene clusters and the non-chorion orthologs adjacent to the scaffold00064 cluster are shown. The reverse complementary strands of the scaffolds are represented in red. scaffold04803 may be adjacent to scaf-fold00032 or to scaffold00064. B. Phylogenetic tree of all chorion protein sequences, based on maximum likelihood. Each class is divided into two subclasses which are clustered together in the genome (early A in green, middle A in red, middle B in blue, and early B in magenta). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 8
Fig. 8
Pathways for the synthesis and secretion of diacylglycerol (DG) in fat body. Fatty acids (FA) entering the fat body, produced de novo, or released from triacylglycerol (TG) by the action of lipases (TGL, ATGL and HSL) on the lipid droplets are reused in part to form DG for export. The acyl-CoA formed by ACS could enter the synthesis of DG through the PA-pathway or through the MG-pathway, which would use monoacylglycerol (MG) produced by the hydrolysis of stored TG. Export of DG to circulating lipophorin is expected to involve the lipid transfer protein (LTP) and other membrane proteins, such as the lipophorin receptor. A similar scheme could be envisaged for the synthesis and export of DG from midgut tissue.
Fig. 9
Fig. 9
An overview of the immune system in M. sexta. Pathogens and their surface molecules (in blue font) are recognized by pattern recognition receptors in the plasma or on the immune cells. A serine protease/serine protease homolog system (shown as pink pacmans) is activated by sequential proteolytic cleavage to generate active phenoloxidases and Spätzle-1. Serpins (colored triangles) modulate melanization and cytokine effects by inhibiting immune SPs. The putative intracellular pathways (Toll, Imd, MAPK-JNK-p38, JAK-STAT) are activated by cytokines (e.g., Spätzle-1) and microbial compounds (e.g., DAP-PG) through receptors, adaptors, kinases (red spheres), and transcription factors (colored ovals), which transactivate the expression of immunity-related genes (e.g., AMPs). Newly synthesized proteins either replenish the defense molecules used up in the initial reaction or serve as effectors to kill the survived pathogens. Autophagy, apoptosis, and RNA interference are involved in insect antiviral responses. The stimulatory and inhibitory steps are depicted as red arrows and blue bars, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 10
Fig. 10
Domain architecture, structural model, phylogenetic relationships, and expression profiles for some of the nondigestive serine proteases and serine protease homologs (SPs/ SPHs) in M. sexta. a) SP50, representing the 52 multidomain SPs/SPHs, has the domains organized in the same way as those in its ortholog Drosophila Nudel; PAP3, one of the 42 clipdomain SPs/SPHs, activates proPOs in the presence of SPH1 and SPH2. b) 3D model of the clip domain-1 in PAP3 is highly similar to the known structure of PAP2 clip domain-1 (Huang et al., 2007). α helix, red; β strand, yellow; coil, green; Cys, pink. c) Phylogenetic analysis of the entire clip-domain SP/SPH sequences in groups A (black, SPH, group-3 clip domain), B (red, SP, group-2 clip domain), C (green, SP, group-1a clip domain), and D (blue, SP, group-1b or-1c clip domain). d) PAP3 and SP50 mRNA levels in M. sexta tissues from various life stages. X-axis, RNA-seq library number; Y-axes, FPKM values of SP50 (black dotted line) and PAP3 (red solid line). PAP3 transcripts are abundant in fat body of wandering larvae and early pupae; SP50 mRNA levels are high in fat body and ovary of late pupae and adults. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Similar articles

Cited by

References

    1. Ahmad SA, Hopkins TL. Beta-glucosylation of plant phenolics by phenol beta-glucosyltransferase in larval tissues of the tobacco hornworm, Manduca sexta (L) Insect Bioch. Mol. Biol. 1993a;23:581–589.
    1. Ahmad SA, Hopkins TL. Phenol beta-glucosyltransferases in 6 species of insects – properties and tissue Localization. Comp. Biochem. Phys. B. 1993b;104:515–519.
    1. Ahmad SA, Hopkins TL, Kramer KJ. Tyrosine beta-glucosyltransferase in the tobacco hornworm, Manduca sexta (L): properties, tissue localization, and developmental profile. Insect Biochem. Mol. Biol. 1996;26:49–57.
    1. Ahn SJ, Vogel H, Heckel DG. Comparative analysis of the UDP-glycosyltransferase multigene family in insects. Insect Biochem. Mol. Biol. 2012;42:133–147. - PubMed
    1. Ahola V, Lehtonen R, Somervuo P, Salmela L, Koskinen P, Rastas P, Valimaki N, Paulin L, Kvist J, Wahlberg N, Tanskanen J, Hornett EA, Ferguson LC, Luo S, Cao Z, de Jong MA, Duplouy A, Smolander OP, Vogel H, McCoy RC, Qian K, Chong WS, Zhang Q, Ahmad E, Haukka JK, Joshi A, Salojarvi J, Wheat CW, Grosse-Wilde E, Hughes D, Katainen R, Pitkanen E, Ylinen J, Waterhouse RM, Turunen M, Vaharautio A, Ojanen SP, Schulman AH, Taipale M, Lawson D, Ukkonen E, Makinen V, Goldsmith MR, Holm L, Auvinen P, Frilander MJ, Hanski I. The Glanville fritillary genome retains an ancient karyotype and reveals selective chromosomal fusions in Lepidoptera. Nat. Commun. 2014;5:4737. - PMC - PubMed

Publication types