Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 12;20(2):e1011810.
doi: 10.1371/journal.pcbi.1011810. eCollection 2024 Feb.

Generating synthetic population for simulating the spatiotemporal dynamics of epidemics

Affiliations

Generating synthetic population for simulating the spatiotemporal dynamics of epidemics

Kemin Zhu et al. PLoS Comput Biol. .

Abstract

Agent-based models have gained traction in exploring the intricate processes governing the spread of infectious diseases, particularly due to their proficiency in capturing nonlinear interaction dynamics. The fidelity of agent-based models in replicating real-world epidemic scenarios hinges on the accurate portrayal of both population-wide and individual-level interactions. In situations where comprehensive population data are lacking, synthetic populations serve as a vital input to agent-based models, approximating real-world demographic structures. While some current population synthesizers consider the structural relationships among agents from the same household, there remains room for refinement in this domain, which could potentially introduce biases in subsequent disease transmission simulations. In response, this study unveils a novel methodology for generating synthetic populations tailored for infectious disease transmission simulations. By integrating insights from microsample-derived household structures, we employ a heuristic combinatorial optimizer to recalibrate these structures, subsequently yielding synthetic populations that faithfully represent agent structural relationships. Implementing this technique, we successfully generated a spatially-explicit synthetic population encompassing over 17 million agents for Shenzhen, China. The findings affirm the method's efficacy in delineating the inherent statistical structural relationship patterns, aligning well with demographic benchmarks at both city and subzone tiers. Moreover, when assessed against a stochastic agent-based Susceptible-Exposed-Infectious-Recovered model, our results pinpointed that variations in population synthesizers can notably alter epidemic projections, influencing both the peak incidence rate and its onset.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The framework of population synthesis.
The basemap shapefile can be accessed at https://www.kaggle.com/datasets/keminzhu/basemap-shenzhen-subzones.
Fig 2
Fig 2. Distribution of household structure in survey data.
(a) Probability/Cumulative Density Function. (b) Frequency-rank and Liner Regression.
Fig 3
Fig 3. Changes in the objective function value with the iteration number.
Fig 4
Fig 4. Spatial distribution for full population by age group across subregions.
The basemap shapefile can be accessed at https://www.kaggle.com/datasets/keminzhu/basemap-shenzhen-subzones.
Fig 5
Fig 5. Age distributions obtained from the demographic data and synthetic population at the subzone level, where each point represents the number of people in a certain age group within a subzone.
Fig 6
Fig 6. Comparison of marginal distributions obtained from the demographic data and synthetic population for household size, age, and gender.
Fig 7
Fig 7. Comparison of the joint distribution of age-gender obtained from the survey dataset and synthetic population.
Fig 8
Fig 8. Comparison of the motif distribution in the synthetic populations generated by different methods, with motifs ordered according to the survey data.
Fig 9
Fig 9. Comparison of interdependency distributions in simulated populations using different methods with those in the survey data, where the value of each cell represents the average frequency of the corresponding cross-age relationship in households.
Fig 10
Fig 10. Framework of the agent-based epidemic model.
(a) The compartmental model used to describe the natural history of the infectious disease between the states. (b) Schematic illustration of the weighted multilayer contact network. Details of the epidemic model and the transitions between compartments are provided in the S1 Text.
Fig 11
Fig 11. Epidemic curve simulated with a different synthetic population.

Similar articles

Cited by

References

    1. Alzu’bi AA, Alasal SIA, Watzlaf VJ. A simulation study of coronavirus as an epidemic disease using agent-based modeling. Perspectives in health information management. 2021;18(Winter). - PMC - PubMed
    1. Hunter E, Mac Namee B, Kelleher J. An open-data-driven agent-based model to simulate infectious disease outbreaks. PloS one. 2018;13(12):e0208775. doi: 10.1371/journal.pone.0208775 - DOI - PMC - PubMed
    1. Perez L, Dragicevic S. An agent-based approach for modeling dynamics of contagious disease spread. International journal of health geographics. 2009;8(1):1–17. doi: 10.1186/1476-072X-8-50 - DOI - PMC - PubMed
    1. Venkatramanan S, Lewis B, Chen J, Higdon D, Vullikanti A, Marathe M. Using data-driven agent-based models for forecasting emerging infectious diseases. Epidemics. 2018;22:43–9. doi: 10.1016/j.epidem.2017.02.010 - DOI - PMC - PubMed
    1. Bissett KR, Cadena J, Khan M, Kuhlman CJ. Agent-Based Computational Epidemiological Modeling. Journal of the Indian Institute of Science. 2021;101(3):303–27. doi: 10.1007/s41745-021-00260-2 - DOI - PMC - PubMed