Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Aug;7(8):e1002135.
doi: 10.1371/journal.pcbi.1002135. Epub 2011 Aug 18.

Statistical inference for multi-pathogen systems

Affiliations

Statistical inference for multi-pathogen systems

Sourya Shrestha et al. PLoS Comput Biol. 2011 Aug.

Abstract

There is growing interest in understanding the nature and consequences of interactions among infectious agents. Pathogen interactions can be operational at different scales, either within a co-infected host or in host populations where they co-circulate, and can be either cooperative or competitive. The detection of interactions among pathogens has typically involved the study of synchrony in the oscillations of the protagonists, but as we show here, phase association provides an unreliable dynamical fingerprint for this task. We assess the capacity of a likelihood-based inference framework to accurately detect and quantify the presence and nature of pathogen interactions on the basis of realistic amounts and kinds of simulated data. We show that when epidemiological and demographic processes are well understood, noisy time series data can contain sufficient information to allow correct inference of interactions in multi-pathogen systems. The inference power is dependent on the strength and time-course of the underlying mechanism: stronger and longer-lasting interactions are more easily and more precisely quantified. We examine the limitations of our approach to stochastic temporal variation, under-reporting, and over-aggregation of data. We propose that likelihood shows promise as a basis for detection and quantification of the effects of pathogen interactions and the determination of their (competitive or cooperative) nature on the basis of population-level time-series data.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Schematics of a two pathogen model with various interaction mechanisms.
Each box represents a possible host state, with individuals formula image categorized according to their status with regards to the two pathogens. Letters formula image, formula image, formula image, and formula image stand for susceptible, infected, convalescent, and recovered, respectively. The horizontal arrows follow the progression of a host's infection due to the first pathogen, and the vertical arrows follow the progression of the second. The diagonal arrows represent disease independent births and deaths. The transitions denoted by red arrows are affected by pathogen interaction.
Figure 2
Figure 2. Phase relation between the two epidemics in the simulation of the model.
Level contours plot the fraction of time epidemics are in-phase [Left], and anti-phase [Right]. Phase difference is calculated by considering 5000 years of simulation (100 years of transients are excluded), computing the fraction of the time series during which strains are in-phase and anti-phase and averaging these fractions over 40 stochastic replicates. Strains are categorized as in phase if the phase difference is less than an eighth of the period, and anti-phase if the difference is formula image of a period. The three points marked (I), (II) and (III) are distinct scenarios examined in our inference tests. Model parameters are as in Table 1, with formula image.
Figure 3
Figure 3. Inference under scenario I: No pathogen interaction.
Inference is carried out for two separate data sets constructed from the same set of parameter values – results are shown in [Left] and [Right] columns for each data set. [Top] Simulated case-data for the two infections are plotted in solid and dashed lines. Log-likelihood profiles for parameters describing the short (formula image) [Middle] and the long term (formula image) [Bottom] interactions. In the insets, we show close-ups of the profiles near the peaks. Plotted formula image are relative difference in the raw log-likelihood from the reference point set at formula image, indicated by the horizontal dashed line. formula image represents the 95% confidence interval–parameter values corresponding to a positive formula image are within the confidence bound. The gray dots indicate the repeated likelihood estimates (formula image replicate SMC calculations for each profile point, formula image particles in each SMC calculation). The profiles are created by fitting a smooth line through the log of the arithmetic mean likelihoods (shown in black dots). The vertical red dashed line is plotted at the actual parameter value used to generate the simulated case-data. Parameters not shown in the graph are taken from Table 1.
Figure 4
Figure 4. Inference under scenario II: Temporary cross-immunity.
Inference is carried out for two separate data sets constructed from the same set of parameter values – results are shown in [Left] and [Right] columns for each data set. [Top] Simulated case-data for the two infections are plotted in solid and dashed lines. Log-likelihood profiles for parameters describing the short (formula image) [Middle] and the long term (formula image) [Bottom] interactions. In the insets, we show close-ups of the profiles near the peaks. Plotted formula image are relative difference in the raw log-likelihood from the reference point set at formula image, indicated by the horizontal dashed line. formula image represents the 95% confidence interval – parameter values corresponding to a positive formula image are within the confidence bound. The gray dots indicate the repeated likelihood estimates (formula image replicate SMC calculations for each profile point, formula image particles in each SMC calculation). The profiles are created by fitting a smooth line through the log of the arithmetic mean likelihoods (shown in black dots). The vertical red dashed line is plotted at the actual parameter value used to generate the simulated case-data. Parameters not shown in the graph are taken from Table 1.
Figure 5
Figure 5. Inference under scenario III: Partial and temporary cross-immunity, and delayed but permanent enhancement.
Inference is carried out for two separate data sets constructed from the same set of parameter values – results are shown in [Left] and [Right] columns for each data set. [Top] Simulated case-data for the two infections are plotted in solid and dashed lines. Log-likelihood profiles for parameters describing the short (formula image) [Middle] and the long term (formula image) [Bottom] interactions. In the insets, we show close-ups of the profiles near the peaks. Plotted formula image are relative difference in the raw log-likelihood from the reference point set at formula image, indicated by the horizontal dashed line. formula image represents the 95% confidence interval – parameter values corresponding to a positive formula image are within the confidence bound. The gray dots indicate the repeated likelihood estimates (formula image replicate SMC calculations for each profile point, formula image particles in each SMC calculation). The profiles are created by fitting a smooth line through the log of the arithmetic mean likelihoods (shown in black dots). The vertical red dashed line is plotted at the actual parameter value used to generate the simulated case-data. Parameters not shown in the graph are taken from Table 1.
Figure 6
Figure 6. Two-parameter profile log-likelihood surfaces: strength and duration of the short term interaction.
formula imageloglik contours (strength of the short term interaction, formula image, on the horizontal axis and the average duration of such interaction, formula image, on the vertical axis) for (top to bottom) scenarios I, II, and III. The red cross indicates the actual parameter values. Darker contours correspond to parameter regions that have higher log-likelihood, and more consistent with the data. Solid white lines show the 95% confidence regions. Parameters as in Table 1.
Figure 7
Figure 7. Inference precision and accuracy as a function of time series length.
We compare the shape of the log-likelihood profiles for short term interaction formula image as the size of the data varies, in three different scenarios. Other parameters are taken from Table 1.
Figure 8
Figure 8. Simultaneous inference of under-reporting and long-term interaction.
Plotted are formula imageloglik contours (strength of the long-term interaction, formula image, on the horizontal axis and the reporting rate, formula image, on the vertical axis). Marked in red crosses are the actual parameter values for formula image and formula image. Darker contours correspond to parameter regions that have higher log-likelihood. Solid white lines show the 95% confidence regions. For this figure, short term interaction formula image. Parameters not shown in the graph are taken from Table 1. See Fig. S3 in the supplementary information (Text S1) for the simulated data and corresponding profiles.
Figure 9
Figure 9. Inference under scenario III with aggregated data.
Inference is carried out for two separate data sets constructed from the same set of parameter values – results are shown in [Left] and [Right] columns for each data set. [Top] These are the same data sets used to make Fig. 5. For each data set, the two time series are added together to form a single aggregated time series. Log-likelihood profiles for parameters describing the short (formula image) [Middle] and the long term (formula image) [Bottom] interactions. In the insets, we show close-ups of the profiles near the peaks. Plotted formula image are relative difference in the raw log-likelihood from the reference point set at formula image, indicated by the horizontal dashed line. formula image represents the 95% confidence interval – parameter values corresponding to a positive formula image are within the confidence bound. The gray dots indicate the repeated likelihood estimates (formula image replicate SMC calculations for each profile point, formula image particles in each SMC calculation). The profiles are created by fitting a smooth line through the log of the arithmetic mean likelihoods (shown in black dots). The vertical red dashed line is plotted at the actual parameter value used to generate the simulated case-data. Parameters not shown in the graph are taken from Table 1.

References

    1. Anderson RM, May RM. Infectious Diseases of Humans; Dynamics and Control. Oxford: Oxford University Press; 1991.
    1. Ferguson N, Anderson R, Garnett G. Mass vaccination to control chickenpox: the inuence of zoster. Proc Natl Acad Sci U S A. 1996;93:7231–5. - PMC - PubMed
    1. Grenfell B, Bjornstad O, Kappey J. Travelling waves and spatial hierarchies in measles epidemics. Nature. 2001;414:716–723. - PubMed
    1. Keeling M, Rohani P. Modelling Infectious Diseases. Princeton: Princeton University Press; 2008.
    1. Fenton A, Pedersen A. Community epidemiology framework for classifying disease threats. Emerg Infect Dis. 2005;11:1815–1821. - PMC - PubMed

Publication types