Solving high-dimensional partial differential equations using deep learning

Jiequn Han¹, Arnulf Jentzen², Weinan E^{3

4

5}

Affiliations

¹ Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544.
² Seminar for Applied Mathematics, Department of Mathematics, ETH Zürich, 8092 Zürich, Switzerland.
³ Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544; weinan@math.princeton.edu.
⁴ Department of Mathematics, Princeton University, Princeton, NJ 08544.
⁵ Beijing Institute of Big Data Research, Beijing 100871, China.

PMID: 30082389
PMCID: PMC6112690
DOI: 10.1073/pnas.1718942115

Solving high-dimensional partial differential equations using deep learning

Jiequn Han et al. Proc Natl Acad Sci U S A. 2018.

. 2018 Aug 21;115(34):8505-8510.

doi: 10.1073/pnas.1718942115. Epub 2018 Aug 6.

Authors

Jiequn Han¹, Arnulf Jentzen², Weinan E^{3

4

5}

Affiliations

¹ Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544.
² Seminar for Applied Mathematics, Department of Mathematics, ETH Zürich, 8092 Zürich, Switzerland.
³ Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544; weinan@math.princeton.edu.
⁴ Department of Mathematics, Princeton University, Princeton, NJ 08544.
⁵ Beijing Institute of Big Data Research, Beijing 100871, China.

PMID: 30082389
PMCID: PMC6112690
DOI: 10.1073/pnas.1718942115

Abstract

Developing algorithms for solving high-dimensional partial differential equations (PDEs) has been an exceedingly difficult task for a long time, due to the notoriously difficult problem known as the "curse of dimensionality." This paper introduces a deep learning-based approach that can handle general high-dimensional parabolic PDEs. To this end, the PDEs are reformulated using backward stochastic differential equations and the gradient of the unknown solution is approximated by neural networks, very much in the spirit of deep reinforcement learning with the gradient acting as the policy function. Numerical results on examples including the nonlinear Black-Scholes equation, the Hamilton-Jacobi-Bellman equation, and the Allen-Cahn equation suggest that the proposed algorithm is quite effective in high dimensions, in terms of both accuracy and cost. This opens up possibilities in economics, finance, operational research, and physics, by considering all participating agents, assets, resources, or particles together at the same time, instead of making ad hoc assumptions on their interrelationships.

Keywords: Feynman–Kac; backward stochastic differential equations; deep learning; high dimension; partial differential equations.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Fig. 1.**
Plot of $θ_{u_{0}}$ as an approximation of $u (t = 0, x = (100, \dots, 100))$ against the number of iteration steps in the case of the $100$ -dimensional nonlinear Black–Scholes equation with $40$ equidistant time steps ( $N = 40$ ) and learning rate $0.008$ . The shaded area depicts the mean $\pm$ the SD of $θ_{u_{0}}$ as an approximation of $u (t = 0, x = (100, \dots, 100))$ for five independent runs. The deep BSDE method achieves a relative error of size $0.46 %$ in a runtime of $1,607$ s.

**Fig. 2.**
(*Top*) Relative error of the deep BSDE method for $u (t = 0, x = (0, \dots, 0))$ when $λ = 1$ against the number of iteration steps in the case of the $100$ -dimensional HJB Eq. 13 with $20$ equidistant time steps ( $N = 20$ ) and learning rate $0.01$ . The shaded area depicts the mean $\pm$ the SD of the relative error for five different runs. The deep BSDE method achieves a relative error of size $0.17 %$ in a runtime of $330$ s. (*Bottom*) Optimal cost $u (t = 0, x = (0, \dots, 0))$ against different values of $λ$ in the case of the $100$ -dimensional HJB Eq. 13, obtained by the deep BSDE method and classical Monte Carlo simulations of Eq. 14.

**Fig. 3.**
(*Top*) Relative error of the deep BSDE method for $u (t = 0.3, x = (0, \dots, 0))$ against the number of iteration steps in the case of the $100$ -dimensional Allen–Cahn Eq. 15 with $20$ equidistant time steps ( $N = 20$ ) and learning rate $0.0005$ . The shaded area depicts the mean $\pm$ the SD of the relative error for five different runs. The deep BSDE method achieves a relative error of size $0.30 %$ in a runtime of $647$ s. (*Bottom*) Time evolution of $u (t, x = (0, \dots, 0))$ for $t \in [0,0.3]$ in the case of the $100$ -dimensional Allen–Cahn Eq. 15 computed by means of the deep BSDE method.

**Fig. 4.**
Illustration of the network architecture for solving semilinear parabolic PDEs with $H$ hidden layers for each subnetwork and $N$ time intervals. The whole network has $(H + 1) (N - 1)$ layers in total that involve free parameters to be optimized simultaneously. Each column for $t = t_{1}, t_{2}, \dots, t_{N - 1}$ corresponds to a subnetwork at time $t$ . $h_{n}^{1}, \dots, h_{n}^{H}$ are the intermediate neurons in the subnetwork at time $t = t_{n}$ for $n = 1,2, \dots, N - 1$ .

See this image and copyright information in PMC

References

1. Bellman RE. Dynamic Programming. Princeton Univ Press; Princeton: 1957.
1. Goodfellow I, Bengio Y, Courville A. Deep Learning. MIT Press; Cambridge, MA: 2016.
1. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444. - PubMed
1. Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. In: Bartlett P, Pereira F, Burges CJC, Bottou L, Weinberger KQ, editors. Advances in Neural Information Processing Systems. Vol 25. Curran Associates, Inc.; Red Hook, NY: 2012. pp. 1097–1105.
1. Hinton G, et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process Mag. 2012;29:82–97.

Publication types

Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Solving high-dimensional partial differential equations using deep learning

Affiliations

Solving high-dimensional partial differential equations using deep learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources

Other Literature Sources