Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 May;474(2213):20170844.
doi: 10.1098/rspa.2017.0844. Epub 2018 May 23.

Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks

Affiliations

Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks

Pantelis R Vlachas et al. Proc Math Phys Eng Sci. 2018 May.

Abstract

We introduce a data-driven forecasting method for high-dimensional chaotic systems using long short-term memory (LSTM) recurrent neural networks. The proposed LSTM neural networks perform inference of high-dimensional dynamical systems in their reduced order space and are shown to be an effective set of nonlinear approximators of their attractor. We demonstrate the forecasting performance of the LSTM and compare it with Gaussian processes (GPs) in time series obtained from the Lorenz 96 system, the Kuramoto-Sivashinsky equation and a prototype climate model. The LSTM networks outperform the GPs in short-term forecasting accuracy in all applications considered. A hybrid architecture, extending the LSTM with a mean stochastic model (MSM-LSTM), is proposed to ensure convergence to the invariant measure. This novel hybrid method is fully data-driven and extends the forecasting capabilities of LSTM networks.

Keywords: Gaussian processes; Lorenz 96; T21 barotropic climate model; data-driven forecasting; long short-term memory.

PubMed Disclaimer

Conflict of interest statement

We declare we have no competing interests.

Figures

Figure 1.
Figure 1.
(a) A recurrent neural network cell, where D denotes a delay. The hidden cell state ht depends on the input it and its previous value ht−1. The output ot depends on the hidden state. The weight matrices are parameters of the cell. (b) A recurrent neural network unfolded in time (unfolding the delay). The same weights are used at each time step to compute the output ot that depends on the current input it and short-term history (recursively) encoded in ht−1.
Figure 2.
Figure 2.
Iterative prediction using the trained LSTM model. A short-term history of the system, i.e. z1true,,zdtrue, is assumed to be known. Initial LSTM states are h0,C0. The trained LSTM is used predict the derivative z˙dpred=Fw(zd:1true,h0,C0). The state prediction zd+1pred is obtained by integrating this derivative. This value is used for the next prediction in an iterative fashion. After d time-steps only predicted values are fed in the input. In stateless LSTM, h and C are initialized to zero before every prediction. (Online version in colour.)
Figure 3.
Figure 3.
Lorenz 96 contour plots for different forcing regimes F. Chaoticity rises with bigger values of F.
Figure 4.
Figure 4.
Energy spectrum Ek and cumulative energy with respect to the number of most energetic modes used for different forcing regimes of Lorenz 96 system. As the forcing increases, more chaoticity is introduced to the system. (Online version in colour.)
Figure 5.
Figure 5.
(ac) Short-term RMSE evolution of the most energetic mode for forcing regimes F=4,8,16, respectively, of the Lorenz 96 system. (df) Long-term RMSE evolution. (gi) Evolution of the ACC coefficient (in all plots average over 1000 initial conditions is reported). (Online version in colour.)
Figure 6.
Figure 6.
RMSE prediction error evolution of four energetic modes for the Lorenz 96 system with forcing F=8. (a) Most energetic mode k=8, (b) low-energy mode k=9, (c) low-energy mode k=10 and (d) low-energy mode k=11 (in all plots average over 1000 initial conditions reported). (Online version in colour.)
Figure 7.
Figure 7.
(a) Ratio of the ensemble members evaluated using the LSTM model over time for different Lorenz 96 forcing regimes in the hybrid LSTM–MSM method and (b) the same for GPR in the hybrid GPR–MSM method (average over 500 initial conditions). (Online version in colour.)
Figure 8.
Figure 8.
(a) Contour plots of the solution u(x,t) of the Kuramoto–Sivashinsky system for different values of ν in steady state. Chaoticity rises with smaller values of ν. (b) Cumulative energy as a function of the number of the PCA modes for different values of ν. (Online version in colour.)
Figure 9.
Figure 9.
(a,b) RMSE evolution of the most energetic mode of the K-S equation with 1/ν=10 and 1/ν=16. (c), (d) ACC evolution of the most energetic mode of the K-S equation with 1/ν=10 and 1/ν=16 (in all plots, average value over 1000 initial conditions is reported). (Online version in colour.)
Figure 10.
Figure 10.
(a) Mean of the Barotropic model at statistical steady state. (b) Variance of the Barotropic model at statistical steady state. (c) Percentage of energy explained with respect to the modelled modes. (Online version in colour.)
Figure 11.
Figure 11.
RMSE evolution of the four most energetic EOFs for the Barotropic climate model, average over 500 initial conditions reported. (a) Most energetic EOF, (b) second most energetic EOF, (c) third most energetic EOF and (d) fourth most energetic EOF. (Online version in colour.)

References

    1. Rowley CW. 2005. Model reduction for fluids, using balanced proper orthogonal decomposition. Int. J. Bifurcation Chaos 15, 997–1013. (doi:10.1142/S0218127405012429) - DOI
    1. Williams MO, Kevrekidis IG, Rowley CW. 2015. A data-driven approximation of the Koopman operator: extending dynamic mode decomposition. J. Nonlinear Sci. 25, 1307–1346. (doi:10.1007/s00332-015-9258-5) - DOI
    1. Tu JH, Rowley CW, Luchtenburg DM, Brunton SL, Kutz JN. 2014. On dynamic mode decomposition: theory and applications. J. Comput. Dyn. 1, 391–421. (doi:10.3934/jcd.2014.1.391) - DOI
    1. Kutz JN, Fu X, Brunton SL. 2016. Multiresolution dynamic mode decomposition. SIAM. J. Appl. Dyn. Syst. 15, 713–735. (doi:10.1137/15M1023543) - DOI
    1. Arbabi H, Mezic I. 2017. Ergodic theory, dynamic mode decomposition and computation of spectral properties of the Koopman operator. SIAM. J. Appl. Dyn. Syst. 16, 2096–2126. (doi:10.1137/17M1125236) - DOI