Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 3;11(6):857.
doi: 10.3390/biology11060857.

A Novel Approach to Modeling and Forecasting Cancer Incidence and Mortality Rates through Web Queries and Automated Forecasting Algorithms: Evidence from Romania

Affiliations

A Novel Approach to Modeling and Forecasting Cancer Incidence and Mortality Rates through Web Queries and Automated Forecasting Algorithms: Evidence from Romania

Cristiana Tudor. Biology (Basel). .

Abstract

Cancer remains a leading cause of worldwide mortality and is a growing, multifaceted global burden. As a result, cancer prevention and cancer mortality reduction are counted among the most pressing public health issues of the twenty-first century. In turn, accurate projections of cancer incidence and mortality rates are paramount for robust policymaking, aimed at creating efficient and inclusive public health systems and also for establishing a baseline to assess the impact of newly introduced public health measures. Within the European Union (EU), Romania consistently reports higher mortality from all types of cancer than the EU average, caused by an inefficient and underfinanced public health system and lower economic development that in turn have created the phenomenon of "oncotourism". This paper aims to develop novel cancer incidence/cancer mortality models based on historical links between incidence and mortality occurrence as reflected in official statistics and population web-search habits. Subsequently, it employs estimates of the web query index to produce forecasts of cancer incidence and mortality rates in Romania. Various statistical and machine-learning models-the autoregressive integrated moving average model (ARIMA), the Exponential Smoothing State Space Model with Box-Cox Transformation, ARMA Errors, Trend, and Seasonal Components (TBATS), and a feed-forward neural network nonlinear autoregression model, or NNAR-are estimated through automated algorithms to assess in-sample fit and out-of-sample forecasting accuracy for web-query volume data. Forecasts are produced with the overperforming model in the out-of-sample context (i.e., NNAR) and fed into the novel incidence/mortality models. Results indicate a continuation of the increasing trends in cancer incidence and mortality in Romania by 2026, with projected levels for the age-standardized total cancer incidence of 313.8 and the age-standardized mortality rate of 233.8 representing an increase of 2%, and, respectively, 3% relative to the 2019 levels. Research findings thus indicate that, under the no-change hypothesis, cancer will remain a significant burden in Romania and highlight the need and urgency to improve the status quo in the Romanian public health system.

Keywords: ARIMA; Google Trends; NNAR; Romania; TBATS; cancer; forecasting; incidence; modeling; mortality.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Most common queries related to the search term “cancer”: worldwide (April 2017–March 2022). Source of data: Google Trends. Estimation results using the “gtrendsR” package [19] in R software.
Figure 2
Figure 2
Internet search interest for “cancer” at the world level: (April 2017–March 2022). Source of data: Google Trends. Map is based on estimation results and uses the packages “gtrendsR” [19] and “tmap” [21] in R software.
Figure 3
Figure 3
Trends in cancer mortality rates in selected CEE countries (2011–2018). Estimation results. Plot created in R software (“ggplots” function). Source of data: Eurostat.
Figure 4
Figure 4
Trends in age-standardized cancer incidence and mortality rates in Romania (2010–2019) (panel a); join-points in cancer incidence rate (panel b). Source of data: Romanian Ministry of Health (2021) [32]. Chart in panel (a) is produced in Datawrapper. Chart in panel (b) is produced with the “ggplot” function in R software; join-point regression analysis is performed with the “segmented” package within R software.
Figure 4
Figure 4
Trends in age-standardized cancer incidence and mortality rates in Romania (2010–2019) (panel a); join-points in cancer incidence rate (panel b). Source of data: Romanian Ministry of Health (2021) [32]. Chart in panel (a) is produced in Datawrapper. Chart in panel (b) is produced with the “ggplot” function in R software; join-point regression analysis is performed with the “segmented” package within R software.
Figure 5
Figure 5
The integrated framework for modeling and forecasting cancer incidence and mortality rates.
Figure 6
Figure 6
The relationship (linear—blue line, polynomial—orange line) between related web queries and the age-standardized cancer incidence rate in Romania (panel a). The relationship (linear—blue line, polynomial—orange line) between related web queries and the age-standardized cancer mortality rate in Romania (panel b). Source of data: Romanian Ministry of Health [32]. All estimations were performed in R software; plots were created in R software (i.e.,“ggplot” function).
Figure 7
Figure 7
Forecasted trend over April 2022–March 2026 (48 months) for web queries for the term “cancer” in Romania issued with NNAR (12,6). Source: estimation results. Model information: average of 20 networks, each of which is a 12-6-1 network with 85 weight options.

Similar articles

Cited by

References

    1. World Health Organization (WHO) Cancer. 2022. [(accessed on 4 April 2022)]. Available online: https://www.who.int/news-room/fact-sheets/detail/cancer.
    1. World Cancer Research Fund Worldwide Cancer Data. 2022. [(accessed on 30 March 2022)]. Available online: https://www.wcrf.org/dietandcancer/worldwide-cancer-data/
    1. World Health Organization (WHO) Cancer. 2019. [(accessed on 4 April 2022)]. Available online: https://www.who.int/health-topics/cancer#tab=tab_1.
    1. Ma X., Yu H. Cancer issue: Global burden of cancer. Yale J. Biol. Med. 2006;79:85. - PMC - PubMed
    1. Nagai H., Kim Y.H. Cancer prevention from the perspective of global cancer burden patterns. J. Thorac. Dis. 2017;9:448. doi: 10.21037/jtd.2017.02.75. - DOI - PMC - PubMed