. 2023 Dec 10;42(28):5160-5188.

doi: 10.1002/sim.9906. Epub 2023 Sep 27.

Estimating seroconversion rates accounting for repeated infections by approximate Bayesian computation

Peter F M Teunis¹, Yuke Wang¹, Kristen Aiemjoy^{2

3}, Mirjam Kretzschmar^{4

5}, Marc Aerts⁶

Affiliations

¹ Hubert Department of Global Health, Center for Global Safe WASH, Rollins School of Public Health, Emory University, Atlanta, Georgia, USA.
² Department of Public Health Sciences, Division of Epidemiology, University of California, Davis, California, USA.
³ Department of Microbiology and Immunology, Mahidol University Faculty of Tropical Medicine, Bangkok, Thailand.
⁴ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
⁵ Center for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, The Netherlands.
⁶ Center for Statistics (CenStat), University Hasselt, Hasselt, Belgium.

PMID: 37753713
PMCID: PMC10842067
DOI: 10.1002/sim.9906

Estimating seroconversion rates accounting for repeated infections by approximate Bayesian computation

Peter F M Teunis et al. Stat Med. 2023.

. 2023 Dec 10;42(28):5160-5188.

doi: 10.1002/sim.9906. Epub 2023 Sep 27.

Authors

Peter F M Teunis¹, Yuke Wang¹, Kristen Aiemjoy^{2

3}, Mirjam Kretzschmar^{4

5}, Marc Aerts⁶

Affiliations

¹ Hubert Department of Global Health, Center for Global Safe WASH, Rollins School of Public Health, Emory University, Atlanta, Georgia, USA.
² Department of Public Health Sciences, Division of Epidemiology, University of California, Davis, California, USA.
³ Department of Microbiology and Immunology, Mahidol University Faculty of Tropical Medicine, Bangkok, Thailand.
⁴ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
⁵ Center for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, The Netherlands.
⁶ Center for Statistics (CenStat), University Hasselt, Hasselt, Belgium.

PMID: 37753713
PMCID: PMC10842067
DOI: 10.1002/sim.9906

Abstract

This study presents a novel approach for inferring the incidence of infections by employing a quantitative model of the serum antibody response. Current methodologies often overlook the cumulative effect of an individual's infection history, making it challenging to obtain a marginal distribution for antibody concentrations. Our proposed approach leverages approximate Bayesian computation to simulate cross-sectional antibody responses and compare these to observed data, factoring in the impact of repeated infections. We then assess the empirical distribution functions of the simulated and observed antibody data utilizing Kolmogorov deviance, thereby incorporating a goodness-of-fit check. This new method not only matches the computational efficiency of preceding likelihood-based analyses but also facilitates the joint estimation of antibody noise parameters. The results affirm that the predictions generated by our within-host model closely align with the observed distributions from cross-sectional samples of a well-characterized population. Our findings mirror those of likelihood-based methodologies in scenarios of low infection pressure, such as the transmission of pertussis in Europe. However, our simulations reveal that in settings of higher infection pressure, likelihood-based approaches tend to underestimate the force of infection. Thus, our novel methodology presents significant advancements in estimating infection incidence, thereby enhancing our understanding of disease dynamics in the field of epidemiology.

Keywords: approximate Bayesian computation; empirical distribution function; reinfection; seroincidence.

PubMed Disclaimer

Figures

**Fig. A1:**
Predicted seroresponse for the longitudinal model fitted to pertussis antibody data.

**Fig. A2:**
Posterior predictive density estimates of the peak antibody level and the time to peak, calculated from the estimated parameters $(μ_{0}, μ_{1}, c^{*})$ and $y_{0}$ .

**Fig. A3:**
(a) Relation between serum antibody baseline $y_{0}$ and subsequent peak level $y_{1} = f (y_{0})$ . (b) Distribution of baseline threshold for a subsequent “small jump” in seroresponse.

**Fig. A4:**
Simulated seroresponse of a hypothetical subject, from birth to age 80 (years), infections occurring as a Poisson process with rate 0.2/yr (a,b) and 1/yr (c,d). Longitudinal parameters fitted to pertussis data (a,c): parameters and baseline chosen at random at each new infection. This corresponds with published analyses. (b,d): parameters $(μ_{0}, μ_{1}, c, α, r)$ and baseline $y_{0}$ chosen at birth and kept fixed. Baseline antibody level $y_{0}$ for subsequent infections carried over from the prior episode. Triangles indicate symptomatic (“large jump”: red) or asymptomatic seroconversions (“small jump”: blue).

**Fig. B1:**
Prior (gray) and posterior (black) densities for simulated cross–sectional data. Estimated seroconversion rate $λ$ , and the two noise parameters $(μ, σ)$ . (a), (b), (c): $λ_{s i m} = 0.03 (1 / y r)$ ; (d), (e), (f): $λ_{s i m} = 0.013 (1 / y r)$ . The prior for $l o g (λ)$ was $N (l o g (0.1), 5.0)$ ; for $μ$ this was $N (0.1, 0.5)$ and for $\log (σ) N (l o g (0.6), 0.5)$ .

**Fig. B2:**
Prior (gray) and posterior (black) densities for cross–sectional data from the Netherlands (ESEN study). Estimated seroconversion rate $λ$ , and the two noise parameters $(μ, σ)$ . (a), (b), (c): ages 5 – 10yr; (d), (e), (f): ages 55 – 60 yr. The prior for $l o g (λ)$ was $N (l o g (1.0), 2.0)$ ; for $μ$ this was $N (1.0, 1.0)$ and for $\log (σ) N (l o g (1.0), 1.0)$ .

**Fig. B3:**
Densities for population samples of data of the Netherlands from the ESEN study, and densities from samples fitted by matching EDFs.

**Fig. B4:**
ESEN data, (a) Estimated seroconversion rates $λ_{est}$ by (5 yr) age categories. Two fitting methods are shown. ML: maximum likelihood seroincidence with B–noise fixed $ν = 3.0 I U / m l$ . ML adj: maximum likelihood seroincidence with B–noise adjusted to the 95th percentile of the distribution estimated by EDF. KS: EDF method simulating vaccination at age 2, using $p_{K S}$ , fitted by ABC jointly estimating $λ$ and the two noise parameters. (b) Estimated log mean $μ$ of B–noise. (c) Estimated $\log s d σ$ of B–noise.

**Fig. B5:**
ESEN data, (a) Estimated seroconversion rates $λ_{est}$ by (5 yr) age categories. Two fitting methods are shown. ML: maximum likelihood seroincidence with B–noise fixed $ν = 3.0 I U / m l$ . ML adj: maximum likelihood seroincidence with B–noise adjusted to the 95th percentile of the distribution estimated by EDF. KS: EDF method simulating vaccination at age 2, using $p_{K S}$ , fitted by ABC jointly estimating $λ$ and the two noise parameters. (b) Estimated log mean $μ$ of B–noise. (c) Estimated $\log s d σ$ of B–noise.

**Fig. B6:**
Kolmogorov probabilities for EDFs of antibody samples generated from posterior parameter estimates in approximate Bayesian computation. Simulated cross–sectional data without (a) and with (b) vaccination at age 2. Observed cross–sectional population data for the Netherlands from the ESEN study for pertussis, analysed in 10 yr age categories, results in Figures 9 and B5.

**Fig. B7:**
(a) Variation in $λ$ . A subject sampled at age 10 was exposed to an outbreak 5 years ago, causing $λ$ to increase from a baseline 0.05 (1/yr) to a peak infection rate of 10 (1/yr). Duration of the outbreak was 465 days, approximately. (b) sample of intervals for this nonhomogeneous Poisson process $(n = 25)$ .

**Fig. B8:**
Estimated $λ_{est}$ and B–noise parameters $(μ_{est}, σ_{est})$ for an outbreak 2, 5, 10, 20 and 50 years ago $(Δ)$ at time of sampling, for subjects aged 0 – 80 years (a – d) or 0 – 10 years (e–h) (uniform age distributions). Baseline infection rate $λ_{0} = 0.05 (1 / y r)$ , during the outbreak a peak rate of $λ_{1} = 10 (1 / y r)$ is reached. Simulated B–noise distribution parameters: $(μ_{s i m}, σ_{s i m}) = (0.1, 0.6)$ . ML: maximum likelihood estimation; KS: EDF based method. Note how recent changes in $λ$ cause decreased $p_{K S} (d, h)$ .

**Figure 1:**
Simulated seroresponse of a hypothetical subject, from birth to age 80 (years), infections occurring as a Poisson process with rate 0.2/yr. Longitudinal parameters fitted to pertussis data: $(μ_{0}, μ_{1}, c, α, r)$ chosen at birth and kept fixed. The baseline antibody level $y_{0}$ is low at birth. After the first infection $y_{0}$ is carried over from each prior episode for any further infections. Triangles indicate symptomatic (“large jump”: red) or asymptomatic seroconversions (“small jump”: blue).

**Figure 2:**
Fitting $λ$ : output for simulated data for subjects 0–80 yrs of age with baseline distribution parameters: (0.1, 0.6) and seroconversion rate $λ_{sim} = 0.001$ and 0.05 (1/yr). (a) Probability density of (simulated) observed serum antibody distribution and best fitting (minimum $D_{K S}$ or $D_{K L}$ ) simulated distributions. (b) Scaled deviates as a function $λ_{est}$ of the simulated sample of antibody concentrations. (c) and (d): corresponding KS dev (Kolmogorov–Smirnov deviate $D_{K S}$ ); $K L$ div (Kullback–Leibler divergence $D_{K L}$ ); AD dev (Anderson–Darling deviate $A^{2}$ ); KS prob (Kolmogorov probability $p_{K S})$ as a function of $λ$ .

**Figure 3:**
Fitting B–noise distribution parameters: output for simulated data for subjects 0 – 80yrs of age with baseline distribution parameters: (0.1, 0.6) and seroconversion rate $λ_{s i m} = 0.001$ and 0.05 (1/yr). (a) Scaled deviates as a function of the B–noise log mean $μ_{e s t}$ at $λ_{s i m} = 0.001 (1 / y r)$ . (b) Scaled deviates as a function of the B–noise $l o g s d σ_{est}$ at $λ_{sim} = 0.001 (1 / y r)$ . (c) and (d) Same at $λ_{sim} = 0.05 (1 / y r)$ . KS dev: Kolmogorov–Smirnov deviate $D_{K S}$ ; $K L$ div: Kullback–Leibler divergence $D_{K L}$ ; AD dev: Anderson–Darling deviate $A^{2}$ ; KS prob: Kolmogorov probability $p_{K S}$ .

**Figure 4:**
Estimated $λ_{est}$ and B–noise parameters $(μ_{est}, σ_{est})$ for a range of simulated $λ_{sim}$ ranging from 0.001 to 10 (1/yr) in subjects 0 – 80yrs of age and B–noise distribution parameters: $(μ_{sim}, σ_{sim}) = (0.1, 0.6)$ . Baseline $y_{0} = y (t_{inf})$ from previous infections, generating seroresponses as in Figure 1. (a) Two methods for estimation of $λ$ are compared: ML: maximum likelihood using the published seroincidence method with fixed B–noise parameter adjusted to the 95 th percentile of simulated noise $(ν = 2.97 I U / m l)$ , and $K S$ : EDF based method using $p_{K S}$ to jointly estimate $(λ, μ, σ)$ . (b) B–noise log mean $μ_{est}$ and (c) $l o g s d σ_{est}$ , estimated jointly with $λ_{est}$ using ABC. Dashed lines indicate simulated values: $λ_{est} = λ_{sim},$ $μ_{est} = μ_{sim} = 0.1,$ $σ_{est} = σ_{sim} = 0.6$ .

**Figure 5:**
Bias in $λ_{est}$ due to baseline “memory”. (a) Simulated cross–sectional sample generated with $y_{0} = y (t_{inf})$ ; $λ_{est}$ calculated by ML and EDF fitting (KS) with fixed baseline $(y_{0} = y (0))$ and instantaneous seroconversion $(t_{1} = 0)$ . (b) Simulated cross–sectional sample generated with fixed $y_{0} = y (0)$ and instantaneous seroconversion $(t_{1} = 0); λ_{est}$ calculated by ML and EDF fitting (KS) with infection history $(y_{0} = y (t_{inf}))$ and seroconversion $(t_{1} > 0)$ .

**Figure 6:**
Estimated $λ_{est}$ and B–noise parameters $(μ_{est}, σ_{est})$ for a simulated population vaccinated at age 2. a–c: $λ_{est}$ and B–noise parameters $(μ_{est}, σ_{est})$ estimated using a model including vaccination (KS). d–f: same parameters estimated using a model without vaccination at age 2 (KS). For comparison, likelihood based estimated are also shown (ML).

**Figure 7:**
ESEN data from the Netherlands: estimates of $λ$ and probability density of antibody levels. Age 35–40 yr (a): $D_{K S}$ (KS dev) and associated probability $p_{K S}$ (KS prob) as a function of $λ_{est}$ . Also shown $D_{K L}$ ( $K L$ div) and Anderson–Darling $A^{2}$ (AD dev). (b): probability densities of observed antibody levels and minimum $D_{K S}$ (maximum $p_{K S}$ ) sample, and minimum $D_{K L}$ sample.

**Figure 8:**
ESEN data, (a) Estimated seroconversion rates $λ_{est}$ by (5yr) age categories. Two fitting methods are shown. ML: maximum likelihood seroincidence with B–noise fixed $ν = 3.0 I U / m l$ . ML adj: maximum likelihood seroincidence with B–noise adjusted to the 95th percentile of the distribution estimated by EDF. KS: EDF method using $p_{K S}$ , fitted by ABC jointly estimating $λ$ and the two noise parameters. (b) Estimated log mean $μ$ of B–noise. (c) Estimated $\log s d σ$ of B–noise.

**Figure 9:**
ESEN data, (a) Estimated seroconversion rates $λ_{est}$ by (5 yr) age categories. Two fitting methods are shown. ML: maximum likelihood seroincidence with B–noise fixed $ν = 3.0 I U / m l$ . ML adj: maximum likelihood seroincidence with B–noise adjusted to the 95th percentile of the distribution estimated by EDF. KS: EDF method using $p_{K S}$ , fitted by ABC jointly estimating $λ$ and the two noise parameters. (b) Estimated log mean $μ$ of B–noise. (c) Estimated $\log s d σ$ of B–noise.

See this image and copyright information in PMC

References

1. de Greeff SC, Teunis P, de Melker HE, et al. Two-component cluster analysis of a large serodiagnostic database for specificity of increases of IgG antibodies against pertussis toxin in paired serum samples and of absolute values in single serum samples. Clinical and Vaccine Immunology 2012;19(9):1452–1456. doi:10.1128/CVI.00229-12. - DOI - PMC - PubMed
1. Konda T, Kamachi K, Iwaki M, Matsunaga Y. Distribution of pertussis antibodies among different age groups in Japan. Vaccine 2002;20:1711–1717. - PubMed
1. Nardone A, Pebody RG, Maple PAC, Andrews N, Gay NJ, Miller E. Sero-epidemiology of Bordetella pertussis infections in England and Wales. Vaccine 2004;22(9–10):1314–1319. doi:10.1016/j.vaccine.2003.08.039. - DOI - PubMed
1. Peasey AE, Ruiz-Palacios GM, Quigley M, et al. Seroepidemiology and risk factors for sporadic norovirus/Mexico strain. Journal of Infectious Diseases 2004;189(11):2027–2036. - PubMed
1. Teunis PFM, van Eijkeren JCH, Ang CW, et al. Biomarker dynamics: estimating infection rates from serological data. Statistics in Medicine 2012;31(20):2240–2248. doi:10.1002/sim.5322. - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central
- eScholarship, University of California - Access Free Full Text

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Estimating seroconversion rates accounting for repeated infections by approximate Bayesian computation

Affiliations

Estimating seroconversion rates accounting for repeated infections by approximate Bayesian computation

Authors

Affiliations

Abstract

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources