Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 25:12:e81752.
doi: 10.7554/eLife.81752.

Approximating missing epidemiological data for cervical cancer through Footprinting: A case study in India

Affiliations

Approximating missing epidemiological data for cervical cancer through Footprinting: A case study in India

Irene Man et al. Elife. .

Abstract

Local cervical cancer epidemiological data essential to project the context-specific impact of cervical cancer preventive measures are often missing. We developed a framework, hereafter named Footprinting, to approximate missing data on sexual behaviour, human papillomavirus (HPV) prevalence, or cervical cancer incidence, and applied it to an Indian case study. With our framework, we (1) identified clusters of Indian states with similar cervical cancer incidence patterns, (2) classified states without incidence data to the identified clusters based on similarity in sexual behaviour, (3) approximated missing cervical cancer incidence and HPV prevalence data based on available data within each cluster. Two main patterns of cervical cancer incidence, characterized by high and low incidence, were identified. Based on the patterns in the sexual behaviour data, all Indian states with missing data on cervical cancer incidence were classified to the low-incidence cluster. Finally, missing data on cervical cancer incidence and HPV prevalence were approximated based on the mean of the available data within each cluster. With the Footprinting framework, we approximated missing cervical cancer epidemiological data and made context-specific impact projections for cervical cancer preventive measures, to assist public health decisions on cervical cancer prevention in India and other countries.

Keywords: HPV prevalence; cervical cancer incidence; classification; clustering; epidemiology; global health; impact projection; sexual behaviour; viruses.

PubMed Disclaimer

Conflict of interest statement

IM, DG, MB, IB No competing interests declared

Figures

Figure 1.
Figure 1.. Hierarchical structure of availability of cervical cancer epidemiological data.
Figure 2.
Figure 2.. Identified clusters of registry-specific cervical cancer incidence.
Clusterings under (A) 2, (B) 3, and (C) 4 prefixed clusters. Each panel within a row corresponds to a cluster within a k-clustering, with the cluster label given on top of the panel. The cervical cancer incidence data were extracted from volume XI of Cancer Incidence in Five Continents (CI5) (Bray et al., 2017) and the 2012–2016 report by the Indian National Centre for Disease Informatics and Research (NCDIR) (Report of National Cancer Registry Programme, 2020). Black: cluster mean of cervical cancer incidence; dark grey: registry incidence assigned to the cluster; light grey: registry incidence assigned to other clusters.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Registry-specific cervical cancer incidence data from Cancer Incidence in Five Continents (CI5) and National Centre for Disease Informatics and Research (NCDIR).
See Figure 1—source data 1 for whether registries belong to CI5 or NCDIR.
Figure 2—figure supplement 2.
Figure 2—figure supplement 2.. Mean age-specific cervical cancer incidence by cluster.
Figure 3.
Figure 3.. Sexual behaviour data from National AIDS Control Organization (NACO) by Indian state.
Indian state-specific data on (A) median age of first sex, (B) proportion of respondents reporting sex with non-regular partners in the last 12 months, (C) proportion of male respondents reporting sex with commercial partners in the last 12 months, and (D) proportion of male respondents by number of commercial partners in the last 12 months. Each violin plot and the associated cloud of circles correspond to a sexual behaviour variable. Each circle corresponds to the data of a state (or group of states). The data were extracted from the 2006 National Behaviour Surveillance Survey of the National AIDS Control Organization of India (National Behavioural Surveillance Survey: General Population, 2006). Blue and red: Indian states identified in the high and low cervical cancer incidence clusters. Grey: states without cervical cancer incidence data and therefore unknown cluster.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Indian state-specific sexual behaviour data from National AIDS Control Organization (NACO).
States or groups of states as reported in the 2006 National Behaviour Surveillance Survey of the National AIDS Control Organization of India. Other North Eastern states include Arunachal Pradesh, Nagaland, Meghalaya, Mizoram, and Tripura.

Update of

  • doi: 10.1101/2022.06.28.22276994

Similar articles

Cited by

References

    1. Baussano I, Lazzarato F, Brisson M, Franceschi S. Human papillomavirus vaccination at a time of changing sexual behavior. Emerging Infectious Diseases. 2016;22:18–23. doi: 10.3201/eid2201.150791. - DOI - PMC - PubMed
    1. Bonjour M, Charvat H, Franco EL, Piñeros M, Clifford GM, Bray F, Baussano I. Global estimates of expected and preventable cervical cancers among girls born between 2005 and 2014: a birth cohort analysis. The Lancet. Public Health. 2021;6:e510–e521. doi: 10.1016/S2468-2667(21)00046-3. - DOI - PMC - PubMed
    1. Bouvard V, Wentzensen N, Mackie A, Berkhof J, Brotherton J, Giorgi-Rossi P, Kupets R, Smith R, Arrossi S, Bendahhou K, Canfell K, Chirenje ZM, Chung MH, Del Pino M, de Sanjosé S, Elfström M, Franco EL, Hamashima C, Hamers FF, Herrington CS, Murillo R, Sangrajrang S, Sankaranarayanan R, Saraiya M, Schiffman M, Zhao F, Arbyn M, Prendiville W, Indave Ruiz BI, Mosquera-Metcalfe I, Lauby-Secretan B. The IARC perspective on cervical cancer screening. The New England Journal of Medicine. 2021;385:1908–1918. doi: 10.1056/NEJMsr2030640. - DOI - PubMed
    1. Bray F, Mery L, Piñeros M. Cancer incidence in five continents. Ci5. 2017 http://ci5.iarc.fr
    1. Breiman L. Random forests. Machine Learning. 2001;45:5–32. doi: 10.1023/A:1010933404324. - DOI

Publication types