Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2018 Nov 8;8(1):16549.
doi: 10.1038/s41598-018-34812-x.

A complete Leishmania donovani reference genome identifies novel genetic variations associated with virulence

Affiliations
Comparative Study

A complete Leishmania donovani reference genome identifies novel genetic variations associated with virulence

Patrick Lypaczewski et al. Sci Rep. .

Abstract

Leishmania donovani is responsible for visceral leishmaniasis, a neglected and lethal parasitic disease with limited treatment options and no vaccine. The study of L. donovani has been hindered by the lack of a high-quality reference genome and this can impact experimental outcomes including the identification of virulence genes, drug targets and vaccine development. We therefore generated a complete genome assembly by deep sequencing using a combination of second generation (Illumina) and third generation (PacBio) sequencing technologies. Compared to the current L. donovani assembly, the genome assembly reported within resulted in the closure over 2,000 gaps, the extension of several chromosomes up to telomeric repeats and the re-annotation of close to 15% of protein coding genes and the annotation of hundreds of non-coding RNA genes. It was possible to correctly assemble the highly repetitive A2 and Amastin virulence gene clusters. A comparative sequence analysis using the improved reference genome confirmed 70 published and identified 15 novel genomic differences between closely related visceral and atypical cutaneous disease-causing L. donovani strains providing a more complete map of genes associated with virulence and visceral organ tropism. Bioinformatic tools including protein variation effect analyzer and basic local alignment search tool were used to prioritize a list of potential virulence genes based on mutation severity, gene conservation and function. This complete genome assembly and novel information on virulence factors will support the identification of new drug targets and the development of a vaccine for L. donovani.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Location of the gaps along 36 chromosomes that have been closed in this new assembly. Chromosomal locations of gaps are indicated in red. No gaps remain in the current assembly.
Figure 2
Figure 2
Organization of the 4 copies of the A2 gene on chromosome 22 in the attenuated cutaneous L. donovani strain. (a) Locations of the 4 A2 genes are shown in blue and numbered 1–4. Interspaced A2-rel genes are labeled in orange, 3′ A2-rel genes are labeled in green and 5′ A2-rel genes are labeled in yellow. A2-rel genes have no homology with A2 genes. Transcription direction is shown according to strandedness: blue represents reverse strand direction of transcription, red represents forward strand transcription. The genes located in the 63 kb region between opposing A2 clusters are not depicted for clarity. (b) Alignment of the longest (~11 kb+) PacBio reads to the A2 clusters. Reads in the 5′ to 3′ direction labeled in red; reads in the 3′ to 5′ direction labeled in blue. (c) Western blot analysis of A2 proteins in the attenuated cutaneous L. donovani strain. The sizes of the A2 proteins are consistent with the ORFs and number of A2 genes identified in this assembly. (d) Coverage graph of chromosome 22 using Illumina (blue) and PacBio (orange) reads.
Figure 3
Figure 3
L. donovani maintains high levels of synteny with L. major including chromosome 22 where the A2 genes are located. Left: Dot plot of the coding DNA sequences of L. major compared to those of L. donovani generated from our assembly across the entire genome. Right: Synteny comparison of chromosome 22. The outer most circle represents the chromosomal location. The second circle is labelled with genes on the forward strand (blue) and genes on the reverse strand (red). The third circle represents genes that are only present in one of the two compared species. The inner association lines join syntenic genes between the two species.
Figure 4
Figure 4
The new L. donovani genome assembly results in a significant change in gene annotations. (a) New or improved gene annotations are highlighted in Blue along the 36 chromosomes. Compared to the previous L. donovani reference assembly (ASM22713v2 from strain BPK282A1), there were 1,087 protein coding genes unannotated or differently annotated in the current assembly. Unannotated or differently annotated genes were obtained by removing all annotations generated from our assembly that shared 95% or greater similarity to those previously available. (b) Expansion of the amastin gene cluster on chromosome 8. Top track contains the previously two known coding sequences aligned to the previous L. donovani reference assembly (ASM22713v2 from strain BPK282A1). Gaps in the previous assembly depicted as dotted lines. Bottom track contains 10 amastin genes identified in the updated assembly. One previously identified Amastin gene has been aligned, 1 has been expanded and 8 have been annotated de novo.
Figure 5
Figure 5
Verification of previously identified SNPs and location of new SNPs that differ between the virulent VL and attenuated CL strains of L. donovani. Chromosomal location of previously identified homozygous non-synonymous SNPs between the cutaneous and visceral disease derived L. donovani strains (Red) compared to the novel SNPs identified only in this study (Blue) (synonymous and heterozygous codon changes identified are not labeled). Note that all the previously identified SNPs were also identified, or confirmed, in this study. 70 SNPs were previously identified across 66 genes. The same 70 SNPs were identified in this study, with an additional 15 novel SNPs not previously seen specific to the cutaneous strain. Genomic locations of SNPs identified in the previous study were translated to new genomic coordinates based on the new assembly for consistency. Arrows in yellow highlight the position of the previously identified RagC SNP on chromosome 36 and the A2 copy number difference on chromosome 22.
Figure 6
Figure 6
Summary of all genes with non-synonymous mutations between the cutaneous, visceral, and gain-of-function strains of L. donovani. All non-synonymous SNPs and Indels were classified as common to our previous study (2014 CL) or identified in this study (Novel), as well as by their effect on amino acid changes from top to bottom, colored red to green in descending order of likelihood to affect the phenotype of the parasite. 66 genes were common to the previous data set. Of those genes, 7 were previously investigated and 1 was rejected due to an open reading frame misannotation. 25 genes were only listed in this study (Novel). Diagram created using SankeyMATIC (http://sankeymatic.com).

Similar articles

Cited by

References

    1. Alvar J, et al. Leishmaniasis worldwide and global estimates of its incidence. PLoS One. 2012;7:e35671. doi: 10.1371/journal.pone.0035671. - DOI - PMC - PubMed
    1. Peacock CS, et al. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat. Genet. 2007;39:839–847. doi: 10.1038/ng2053. - DOI - PMC - PubMed
    1. Zhang WW, Matlashewski G. Screening Leishmania donovani-specific genes required for visceral infection. Mol. Microbiol. 2010;77:505–517. doi: 10.1111/j.1365-2958.2010.07230.x. - DOI - PubMed
    1. Zhang WW, et al. Genetic Analysis of Leishmania donovani Tropism Using a Naturally Attenuated Cutaneous Strain. PLoS Pathog. 2014;10:e1004244. doi: 10.1371/journal.ppat.1004244. - DOI - PMC - PubMed
    1. Goodwin S, McPherson JD, McCombie WR. Coming of age: Ten years of next-generation sequencing technologies. Nat. Rev. Genet. 2016;17:333–351. doi: 10.1038/nrg.2016.49. - DOI - PMC - PubMed

Publication types

Substances

Grants and funding