Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 14;19(3):e0296810.
doi: 10.1371/journal.pone.0296810. eCollection 2024.

Estimating household contact matrices structure from easily collectable metadata

Affiliations

Estimating household contact matrices structure from easily collectable metadata

Lorenzo Dall'Amico et al. PLoS One. .

Abstract

Contact matrices are a commonly adopted data representation, used to develop compartmental models for epidemic spreading, accounting for the contact heterogeneities across age groups. Their estimation, however, is generally time and effort consuming and model-driven strategies to quantify the contacts are often needed. In this article we focus on household contact matrices, describing the contacts among the members of a family and develop a parametric model to describe them. This model combines demographic and easily quantifiable survey-based data and is tested on high resolution proximity data collected in two sites in South Africa. Given its simplicity and interpretability, we expect our method to be easily applied to other contexts as well and we identify relevant questions that need to be addressed during the data collection procedure.

PubMed Disclaimer

Conflict of interest statement

CCo has received grant support from Sanofi Pasteur, US CDC, Welcome Trust, Programme for Applied Technologies in Health (PATH), Bill & Melinda Gates Foundation and South African Medical Research Council (SA-MRC). NW reports receiving grants from Sanofi Pasteur, US CDC and the Bill & Melinda Gates Foundation. This does not alter our adherence to PLOS ONE policies on sharing data and materials. All other authors do not report any competing interests.

Figures

Fig 1
Fig 1. Data collection schedule for the 60 selected households.
Each row corresponds to a household with the rural site on the left and the urban site on the right. Time is displayed on the x axis and dates are reported in the day/month format. Vertical gray lines correspond to the beginning and end of each deployment. A black dot indicates that at least one contact was measured, while a white one that no contact was recorded on that day.
Fig 2
Fig 2. Properties of the measured data.
a: normalized contact matrix across the three deployments. The color code refers to the values of the logarithm of Rcounts whose entries are proportional to the ratio between the number of contacts and the number of possible interacting pairs, setting the mean of Rcounts to 1. The two axis correspond to the age groups and the number reported indicates the highest age of each group. b: contact duration distribution expressed as number of seconds of interaction across the three deployments in logarithmic scale.
Fig 3
Fig 3. Test of the model for household interaction.
a: histogram of the cosine similarity between C and its estimators. The gray curve corresponds to the histogram over the 2500 realization of X using T as an estimator of C. The orange curve is obtained with the first order model of Eq (2), while the blue curve corresponds to the second order model of Section A second order model for household interaction. b, c: correlation between the fluctuations of the activity δ(u), the group average degree δ(η) and the presence of a major occupation outside the house δ(y). The quantities δa,c are defined in Eq (3). The Pearson correlation coefficient r is reported in text.
Fig 4
Fig 4. Measured vs estimated normalized contact matrices in the two sites.
The first row, in blue, corresponds to Agincourt, the rural site, while the second, in purple, to Klerksdorp, the urban site. The first column shows the matrix C aggregated over the three deployments, as measured by the proximity sensors. The second column is the corresponding random encounter matrix T. The third and the fourth are the estimates obtained by our first and second order models, respectively. All matrices are normalized by the empirical average of their entries.

References

    1. Wallinga J, Edmunds WJ, Kretzschmar M. Perspective: human contact patterns and the spread of airborne infectious diseases. TRENDS in Microbiology. 1999;7(9):372–377. doi: 10.1016/S0966-842X(99)01546-2 - DOI - PubMed
    1. Anderson RM, May RM. Infectious diseases of humans: dynamics and control. Oxford university press; 1992.
    1. Meyers L. Contact network epidemiology: Bond percolation applied to infectious disease prediction and control. Bulletin of the American Mathematical Society. 2007;44(1):63–86. doi: 10.1090/S0273-0979-06-01148-7 - DOI
    1. Verity R, Okell LC, Dorigatti I, Winskill P, Whittaker C, Imai N, et al.. Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet infectious diseases. 2020;20(6):669–677. doi: 10.1016/S1473-3099(20)30243-7 - DOI - PMC - PubMed
    1. Walker PG, Whittaker C, Watson OJ, Baguelin M, Winskill P, Hamlet A, et al.. The impact of COVID-19 and strategies for mitigation and suppression in low-and middle-income countries. Science. 2020;369(6502):413–422. doi: 10.1126/science.abc0035 - DOI - PMC - PubMed