Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Dec 22;7(1):18050.
doi: 10.1038/s41598-017-18374-y.

Resequencing of the Leishmania infantum (strain JPCM5) genome and de novo assembly into 36 contigs

Affiliations

Resequencing of the Leishmania infantum (strain JPCM5) genome and de novo assembly into 36 contigs

Sandra González-de la Fuente et al. Sci Rep. .

Abstract

Leishmania parasites are the causative of leishmaniasis, a group of potentially fatal human diseases. Control strategies for leishmaniasis can be enhanced by genome based investigations. The publication in 2005 of the Leishmania major genome sequence, and two years later the genomes for the species Leishmania braziliensis and Leishmania infantum were major milestones. Since then, the L. infantum genome, although highly fragmented and incomplete, has been used widely as the reference genome to address whole transcriptomics and proteomics studies. Here, we report the sequencing of the L. infantum genome by two NGS methodologies and, as a result, the complete genome assembly on 36 contigs (chromosomes). Regarding the present L. infantum genome-draft, 495 new genes have been annotated, a hundred have been corrected and 75 previous annotated genes have been discontinued. These changes are not only the result of an increase in the genome size, but a significant contribution derives from the existence of a large number of incorrectly assembled regions in current chromosomal scaffolds. Furthermore, an improved assembly of tandemly repeated genes has been obtained. All these analyses support that the de novo assembled L. infantum genome represents a robust assembly and should replace the currently available in the databases.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1
Schematic overview of the workflow leading to the L. infantum genome assembly. Input files (Raw Reads) are represented as yellow rhomboids. All the different software and processes are shown in blue boxes. Output files are represented in green rhomboids. Discarded data are shown in red rhomboids. See Materials and Methods section for additional details.
Figure 2
Figure 2
Read-depth analysis along the chromosomes formed by the fusion of two PacBio-assembled contigs. Coverage was determined by sliding window analysis (bin 200 pb) with either Illumina (in blue) or PacBio (in red) reads, along chromosomes 7, 12, 15, 22, 26, 28, 33 and 35. The sizes of the contigs are shown by lines with arrow-heads. Chromosomes 7 (panel A) and 35 (panel H) were joined by the SSPACE-standard tool. Chromosomes 12, 15, 22, 26 and 28 (panels B–F) were joined using the minimus 2 assembler. Finally, chromosome 33 (panel G) was joined by the SSPACE-LongRead tool.
Figure 3
Figure 3
Gene copy number in the beta-tubulin locus at the chromosome 33. Panel (A): Genomic structure of the region containing the beta-tubulin locus in the L. major Friedlin genome (LmjF), in the L. infantum genome assembled in this work (LinJ-New) and in the current L. infantum assembly (LinJ-Ref).The identity percentage of the BLAST alignment (using the tab –format output of BLAST) is shown by shading with brown hue (scale at top left ranges from 90 to 100% of sequence identity). (B) Distribution of L. major Illumina reads (unpublished laboratory data) along the beta-tubulin locus in the L. major current genome (GeneDB.org). (C) Distribution of L. infantum sequence-reads (Illumina in blue and PacBio in red) along the beta-tubulin genomic region using as reference the L. infantum genome assembled in this work. (D) Distribution of L. infantum reads (Illumina in blue and PacBio in red) along the region containing the beta-tubulin locus in current L. infantum genome (version 9; Tritryp.org).
Figure 4
Figure 4
Gene copy number in the glucose transporter 2 locus. (A) Genomic structure of the region containing the glucose transporter 2 locus in the L. major Friedlin genome (LmjF), in the L. infantum genome assembled in this work (LinJ-New) and in the current L. infantum assembly (LinJ-Ref). See legend to Fig. 3 for the meaning of color codes. (B) Distribution of L. major Illumina reads along the glucose transporter 2 region in the L. major current genome (GeneDB). (C) Distribution of L. infantum sequence-reads (Illumina in blue and PacBio in red) along the glucose transporter 2 genomic region using as reference the L. infantum genome assembled in this work. (D) Distribution of L. infantum reads (Illumina in blue and PacBio in red) along the region containing the glucose transporter 2 locus in current L. infantum genome (version 9; Tritryp.org).
Figure 5
Figure 5
Schematic illustration of mis-assembled regions in the current L. infantum genome. Synteny blocks, represented by different colors, in chromosome 7 (panel A) and chromosome 13 (panel B) after pair-wise comparisons between the L. major Friedlin genome (top), the L. infantum newly assembled genome (middle) and L. infantum reference genome (bottom). Pairwise alignments were generated by the progressive MAUVE algorithm, which uses color codes to depict blocks of conserved regions. Sections located underneath the x-axis show inversion events.

References

    1. Moreira D, Lopez-Garcia P, Vickerman K. An updated view of kinetoplastid phylogeny using environmental sequences and a closer outgroup: proposal for a new classification of the class Kinetoplastea. Int J Syst Evol Microbiol. 2004;54:1861–1875. doi: 10.1099/ijs.0.63081-0. - DOI - PubMed
    1. Akhoundi M, et al. A Historical Overview of the Classification, Evolution, and Dispersion of Leishmania Parasites and Sandflies. PLoS neglected tropical diseases. 2016;10:e0004349. doi: 10.1371/journal.pntd.0004349. - DOI - PMC - PubMed
    1. Murray HW, Berman JD, Davies CR, Saravia NG. Advances in leishmaniasis. Lancet. 2005;366:1561–1577. doi: 10.1016/S0140-6736(05)67629-5. - DOI - PubMed
    1. Fraga J, et al. Evolution and species discrimination according to the Leishmania heat-shock protein 20 gene. Infect Genet Evol. 2013;18:229–237. doi: 10.1016/j.meegid.2013.05.020. - DOI - PubMed
    1. Lukes J, et al. Evolutionary and geographical history of the Leishmania donovani complex with a revision of current taxonomy. Proc Natl Acad Sci USA. 2007;104:9375–9380. doi: 10.1073/pnas.0703678104. - DOI - PMC - PubMed

Publication types

MeSH terms