Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Case Reports
. 2023 Apr 4;14(4):866.
doi: 10.3390/genes14040866.

Leishmania infantum (JPCM5) Transcriptome, Gene Models and Resources for an Active Curation of Gene Annotations

Affiliations
Case Reports

Leishmania infantum (JPCM5) Transcriptome, Gene Models and Resources for an Active Curation of Gene Annotations

Esther Camacho et al. Genes (Basel). .

Abstract

Leishmania infantum is one of the causative agents of visceral leishmaniases, the most severe form of leishmaniasis. An improved assembly for the L. infantum genome was published five years ago, yet delineation of its transcriptome remained to be accomplished. In this work, the transcriptome annotation was attained by a combination of both short and long RNA-seq reads. The good agreement between the results derived from both methodologies confirmed that transcript assembly based on Illumina RNA-seq and further delimitation according to the positions of spliced leader (SAS) and poly-A (PAS) addition sites is an adequate strategy to annotate the transcriptomes of Leishmania, a procedure previously used for transcriptome annotation in other Leishmania species and related trypanosomatids. These analyses also confirmed that the Leishmania transcripts boundaries are relatively slippery, showing extensive heterogeneity at the 5'- and 3'-ends. However, the use of RNA-seq reads derived from the PacBio technology (referred to as Iso-Seq) allowed the authors to uncover some complex transcription patterns occurring at particular loci that would be unnoticed by the use of short RNA-seq reads alone. Thus, Iso-Seq analysis provided evidence that transcript processing at particular loci would be more dynamic than expected. Another noticeable finding was the observation of a case of allelic heterozygosity based on the existence of chimeric Iso-Seq reads that might be generated by an event of intrachromosomal recombination. In addition, we are providing the L. infantum gene models, including both UTRs and CDS regions, that would be helpful for undertaking whole-genome expression studies. Moreover, we have built the foundations of a communal database for the active curation of both gene/transcript models and functional annotations for genes and proteins.

Keywords: Leishmania; Mendeley data; Wikidata; gene expression; gene models; transcriptome.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Complex transcription patterns were observed at particular L. infantum loci. (A) According to Illumina RNA-seq reads coverage and position of SL-addition sites (black vertical arrows), a single transcript was deduced to be expressed from genes LINF_130006600 and LINF_130006600 (horizontal arrow shows the transcriptional direction). The existence of a complete Iso-Seq transcript (molecule shaded in red) derived from gene LINF_130006600 indicated that this gene is expressed into a mature individual RNA. However, no evidence of an individual mRNA for gene LINF_130006700 was found, suggesting that this gene may be expressed as a bicistronic transcript expanding both genes (transcript LINF_130006700-600-T). (B) Illumina RNA-seq reads coverage on gene LINF_240027300 suggested the existence of a sole mRNA molecule for this gene. However, the presence of two complete Iso-seq molecules indicated that two stable mRNAs are produced (LINF_240027300-T2 and LINF_240027300-T3) together with a third molecule expanding the complete gene (LINF_240027300-T1).
Figure 2
Figure 2
Allelic heterogeneity on a genomic region of L. infantum chromosome 32. (A) Two types of transcripts (Iso-Seq molecules) were identified: one derived from gene LINF_320009800 and the other showing a chimeric nature (derived from separated genomic regions). (B) The existence of a lower number of PacBio DNA-seq reads in the region-expanding genes LINF_320009700, and LINF_320009800 suggested an intrachromosomal deletion event affecting one allele. The hypothesized structure for both alleles is depicted together with the PacBio DNA-seq reads coverage on both regions. The existence of a direct repeat of 695 nt in length is denoted by black boxes. (C) Experimental testing by PCR of the deletion event. Two pairs of oligonucleotides (F-R and F-R’) were used to amplify the 3′-UTR of gene LINF_320009700 and the whole genomic region (genes LINF_320009700, LINF_320009800 and LINF_320009900), respectively. The PCR products were separated on a 1% agarose gel (panel C1). A band of around 8.7 kb was also observed after PCR amplification with oligonucleotides F and R’ (panel C2). At the boundaries of gels are shown the sizes (in nucleotides) of the HindIII-digested Ø29 phage DNA bands used as a molecular marker (M). Lanes 3 (gel C1) and 2 (gel C2) are controls in which PCR was performed in the absence of a DNA template.
Figure 3
Figure 3
Steps for creating gene models. (A) Transcripts are assembled from Illumina RNA-seq reads. (B) SL-containing RNA-seq reads are used to map the SL-addition sites (SASs); the main SAS (vertical arrow) is assigned to that supported by the largest number of SL-containing RNA-seq reads, and the rest as categorized as alternatives (see Supplementary File S1 for complete lists of main and alternative SASs). (C) Transcript is trimmed according to the position of the main SAS. (D) Coding sequences (CDS) are automatically annotated by the Companion tool. (E) If CDS surpasses transcript 5′-end, the CDS is manually shortened at the 5′-end. (F) After embedding the CDS into the transcript sequence, the gene model emerges with their 5′ and 3′ untranslated regions (UTRs).

References

    1. Alvar J., Velez I.D., Bern C., Herrero M., Desjeux P., Cano J., Jannin J., den Boer M. Leishmaniasis worldwide and global estimates of its incidence. PLoS ONE. 2012;7:e35671. doi: 10.1371/journal.pone.0035671. - DOI - PMC - PubMed
    1. Volpedo G., Huston R.H., Holcomb E.A., Pacheco-Fernandez T., Gannavaram S., Bhattacharya P., Nakhasi H.L., Satoskar A.R. From infection to vaccination: Reviewing the global burden, history of vaccine development, and recurring challenges in global leishmaniasis protection. Expert Rev. Vaccines. 2021;20:1431–1446. doi: 10.1080/14760584.2021.1969231. - DOI - PubMed
    1. Peacock C.S., Seeger K., Harris D., Murphy L., Ruiz J.C., Quail M.A., Peters N., Adlem E., Tivey A., Aslett M., et al. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat. Genet. 2007;39:839–847. doi: 10.1038/ng2053. - DOI - PMC - PubMed
    1. Rochette A., Raymond F., Corbeil J., Ouellette M., Papadopoulou B. Whole-genome comparative RNA expression profiling of axenic and intracellular amastigote forms of Leishmania infantum. Mol. Biochem. Parasitol. 2009;165:32–47. doi: 10.1016/j.molbiopara.2008.12.012. - DOI - PubMed
    1. Rosenzweig D., Smith D., Opperdoes F., Stern S., Olafson R.W., Zilberstein D. Retooling Leishmania metabolism: From sand fly gut to human macrophage. Faseb J. 2008;22:590–602. doi: 10.1096/fj.07-9254com. - DOI - PubMed

Publication types

LinkOut - more resources