Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan;20(1):14-28.
doi: 10.1111/1755-0998.13091. Epub 2019 Dec 13.

Metatranscriptomics yields new genomic resources and sensitive detection of infections for diverse blood parasites

Affiliations

Metatranscriptomics yields new genomic resources and sensitive detection of infections for diverse blood parasites

Spencer C Galen et al. Mol Ecol Resour. 2020 Jan.

Abstract

Metatranscriptomics is a powerful method for studying the composition and function of complex microbial communities. The application of metatranscriptomics to multispecies parasite infections is of particular interest, as research on parasite evolution and diversification has been hampered by technical challenges to genome-scale DNA sequencing. In particular, blood parasites of vertebrates are abundant and diverse although they often occur at low infection intensities and exist as multispecies infections, rendering the isolation of genomic sequence data challenging. Here, we use birds and their diverse haemosporidian parasites to illustrate the potential for metatranscriptome sequencing to generate large quantities of genome-wide sequence data from multiple blood parasite species simultaneously. We used RNA-sequencing of 24 blood samples from songbirds in North America to show that metatranscriptomes can yield large proportions of haemosporidian protein-coding gene repertoires even when infections are of low intensity (<0.1% red blood cells infected) and consist of multiple parasite taxa. By bioinformatically separating host and parasite transcripts and assigning them to the haemosporidian genus of origin, we found that transcriptomes detected ~23% more total parasite infections across all samples than were identified using microscopy and DNA barcoding. For single-species infections, we obtained data for >1,300 loci from samples with as low as 0.03% parasitaemia, with the number of loci increasing with infection intensity. In total, we provide data for 1,502 single-copy orthologous loci from a phylogenetically diverse set of 33 haemosporidian mitochondrial lineages. The metatranscriptomic approach described here has the potential to accelerate ecological and evolutionary research on haemosporidians and other diverse parasites.

Keywords: Leucocytozoon; Parahaemoproteus; Plasmodium; RNA-seq; co-infection; malaria parasite.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Genus assignment based on a quartet resampling strategy. (A) For each transcriptome contig of unknown origin (green), one reference sequence from each genus (Plasmodium: orange, Parahaemoproteus: purple, Leucocytozoon: blue) is drawn from the same orthogroup alignment. (B) Sequence quartets with less than 150 bp of overlap among all four sequences are discarded. (C) If the contig has best hits (arrows) against different reference sequences in the amino acid and the nucleotide alignments, the quartet is considered ambiguous and no genus assignment is made. (D) If the nucleotide and the amino acid data result in a best hit against the same reference sequence and if at least one best hit is bi-directional (double headed arrow), the quartet is counted as positive for that genus. This procedure is executed for all possible quartet combinations within an orthogroup. Contigs with unambiguous hits against the same genus in the majority of quartets are assigned to that genus. Genus assignment was only conducted using orthogroup alignments with a minimum of two sequences for each reference genus.
Figure 2.
Figure 2.
The phylogenetic diversity of haemosporidian parasites detected in host samples selected for transcriptome sequencing. Shown are cytb phylogenies for the genera (A) Leucocytozoon, (B) Parahaemoproteus, and (C) Plasmodium, generated from all cytb haplotypes available on the avian haemosporidian database MalAvi as of April 2019 (Bensch et al. 2009). Depicted on each phylogeny are the names of the cytb lineages (determined by cytb barcoding) that were included for metatranscriptome sequencing or were used as references. Names in red indicate lineages that were used as references for the genus assignment procedure that were obtained from previously published research, and names in blue indicate lineages that were included in the reference database and were sequenced for this study. Names in black represent lineages that were sequenced in this study and were found in mixed-genus samples, and so were used as input for genus assignment.
Figure 3.
Figure 3.
Samples that contained different parasite genera and infection intensities differed in the composition of the orthogroups for which data was generated. A) Multiple correspondence analysis (MCA) showing that samples with Parahaemoproteus infections of >0.5% parasitemia cluster together to the exclusion of samples with low-intensity (<0.5%) Parahaemoproteus or Leucocytozoon infections. B) Venn diagrams demonstrating overlap in the number of orthogroups for which data was generated for sample groups defined by parasite genus and infection intensity.
Figure 4.
Figure 4.
Relationship between parasitemia and number of haemosporidian transcripts. Samples that generated a higher number of haemosporidian transcripts tended to have more intense infections, contain infections from a greater number of parasite species, and had higher sequencing depth. The number of haemosporidian transcripts shown here refers to the number of transcripts for each sample in the ‘all-orthologs’ dataset.

Similar articles

Cited by

References

    1. Alexander H, Jenkins BD, Rynearson TA, & Dyhrman ST (2015). Metatranscriptome analyses indicate resource partitioning between diatoms in the field. Proceedings of the National Academy of Sciences USA, 112(17), E2182–E2190. doi:10.1073/pnas.1421993112 - DOI - PMC - PubMed
    1. Andrews S (2010). FastQC: a quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
    1. Asghar M, Hasselquist D, Hansson B, Zehtindjiev P, Westerdahl H, & Bensch S (2015). Hidden costs of infection: chronic malaria accelerates telomere degradation and senescence in wild birds. Science, 347(6220), 436–438. doi:10.1126/science.1261121 - DOI - PubMed
    1. Aurrecoechea C, Barreto A, Basenko EY, Brestelli J, Brunk BP, Cade S, … Zheng J (2016). EuPathDB: the eukaryotic pathogen genomics database resource. Nucleic Acids Research, 45(D1), D581–D591. doi:10.1093/nar/gkw1105 - DOI - PMC - PubMed
    1. Aurrecoechea C, Brestelli J, Brunk BP, Dommer J, Fischer S, Gajria B, … Wang H (2008). PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Research, 37(suppl_1), D539–D543. doi:10.1093/nar/gkn814 - DOI - PMC - PubMed

Publication types