Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 27:152:e93.
doi: 10.1017/S0950268824000694.

Analysis and forecasting of syphilis trends in mainland China based on hybrid time series models

Affiliations

Analysis and forecasting of syphilis trends in mainland China based on hybrid time series models

Zhen D Wang et al. Epidemiol Infect. .

Abstract

Syphilis remains a serious public health problem in mainland China that requires attention, modelling to describe and predict its prevalence patterns can help the government to develop more scientific interventions. The seasonal autoregressive integrated moving average (SARIMA) model, long short-term memory network (LSTM) model, hybrid SARIMA-LSTM model, and hybrid SARIMA-nonlinear auto-regressive models with exogenous inputs (SARIMA-NARX) model were used to simulate the time series data of the syphilis incidence from January 2004 to November 2023 respectively. Compared to the SARIMA, LSTM, and SARIMA-LSTM models, the median absolute deviation (MAD) value of the SARIMA-NARX model decreases by 352.69%, 4.98%, and 3.73%, respectively. The mean absolute percentage error (MAPE) value decreases by 73.7%, 23.46%, and 13.06%, respectively. The root mean square error (RMSE) value decreases by 68.02%, 26.68%, and 23.78%, respectively. The mean absolute error (MAE) value decreases by 70.90%, 23.00%, and 21.80%, respectively. The hybrid SARIMA-NARX and SARIMA-LSTM methods predict syphilis cases more accurately than the basic SARIMA and LSTM methods, so that can be used for governments to develop long-term syphilis prevention and control programs. In addition, the predicted cases still maintain a fairly high level of incidence, so there is an urgent need to develop more comprehensive prevention strategies.

Keywords: LSTM; NARX; SARIMA; modelling; syphilis.

PubMed Disclaimer

Conflict of interest statement

The authors declare none.

Figures

Figure 1.
Figure 1.
The cell structure of the LSTM network. Note: The arrow indicates the data flow, where x, s, c, f, i, g, and o denote the input, output, cell state, forget gate, input gate, cell candidate, and output gate in time step t, respectively. σ and tanh denote the sigmoid activation function and the hyperbolic tangent function, which maps the data to (0,1) and (−1,1), respectively. formula image are vector operators which represent element-wise multiplication and element-wise addition, respectively
Figure 2.
Figure 2.
Monthly reported cases of syphilis from January 2004 to November 2023 and the decomposition of the TS data. Note: In (a), the blue curve depicts the monthly reported incidences of syphilis, while the red curve illustrates the long-term trend. Meanwhile, in (b), the blue curve represents the stable seasonal component exhibiting a periodicity of 12 months, and the yellow dash curve portrays the time series post-seasonal component extraction
Figure 3.
Figure 3.
SARIMA model residuals normality and autocorrelation diagnostics. Note: (a) The frequency distribution of standardized residuals using a histogram. (b) The QQ plots of residuals of the SARIMA model, the red dashed line represents the standard normal distribution. (c,d) The ACF and PACF of residuals, respectively. The stem plots represent the values of ACF and PACF at different lags, and the blue lines indicate the ±2 times standard deviation interval
Figure 4.
Figure 4.
Fitting and predicting the performance of LSTM (a), SARIMA-LSTM (b), and SARIMA-NARX (c) models with different structures. Note: The dark blue and grey bars represent the RMSE values of the training and test sets, respectively
Figure 5.
Figure 5.
The simulation process for the training set by the SARIMA-NARX model. Note: (a) The variation of MSE for the training, validation, and test sets during the iteration process. (b) The error between the output values of each component data and the target values, while (c) provides a detailed display of the error magnitude. The blue, yellow, and red dots indicate the target values of the training set, validation set, and test set after simulation using the SARIMA-NARX model, and the blue, yellow, and red crosses denote the outputs of the training set, validation set, and test set, and the yellow stem denotes the error of fitting
Figure 6.
Figure 6.
Fitting and forecasting performance of the SARIMA, LSTM, SARIMA-LSTM, and SARIMA-NARX models. Note: Panels (a), (c), (e), and (g) denote the fitting and predicting results using the SARIMA, LSTM, SARIMA-LSTM, and SARIMA-NARX models, respectively, the red and yellow curves represent the simulation values for the train set and test set of the TS. Panels (b), (d), (f), and (h) denote the residuals of the SARIMA, LSTM, SARIMA-LSTM, and SARIMA-NARX models, respectively, the blue and yellow stems represent the residuals for the train sets and test sets, respectively
Figure 7.
Figure 7.
Prediction results from December 2023 to November 2025 of SARIMA-LSTM, and SARIMA-NARX models. Note: The light gray and blue areas respectively represent the forecast intervals of SARIMA-NARX and SARIMA-LSTM. The blue, red, and yellow curves represent the original data, the fitted values of the SARIMA-LSTM and SARIMA-NARX models, and the predicted values of the two models are represented by the red and yellow dashed lines

Similar articles

References

    1. Kenyon C, et al. (2023) Management of asymptomatic sexually transmitted infections in Europe: Towards a differentiated, evidence-based approach. The Lancet Regional Health Europe 34, 100743. - PMC - PubMed
    1. Hook EW 3rd (2017) Syphilis. Lancet (London, England) 389(10078), 1550–1557. - PubMed
    1. WHO (2023) Overview of syphlis. Available at https://wwwwhoint/health-topics/syphilis#tab=tab_1 (accessed 9 August 2023).
    1. Mercuri SR, et al. (2022) Syphilis: A mini review of the history, epidemiology and focus on microbiota. The New Microbiologica 45(1), 28–34. - PubMed
    1. Chauhan K, et al. (2023) Demystifying ocular syphilis - A major review. Ocular Immunology and Inflammation 31(7), 1425–1439. - PubMed