Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 22:6:1129370.
doi: 10.3389/frai.2023.1129370. eCollection 2023.

Gamma and vega hedging using deep distributional reinforcement learning

Affiliations

Gamma and vega hedging using deep distributional reinforcement learning

Jay Cao et al. Front Artif Intell. .

Abstract

We show how reinforcement learning can be used in conjunction with quantile regression to develop a hedging strategy for a trader responsible for derivatives that arrive stochastically and depend on a single underlying asset. We assume that the trader makes the portfolio delta-neutral at the end of each day by taking a position in the underlying asset. We focus on how trades in options can be used to manage gamma and vega. The option trades are subject to transaction costs. We consider three different objective functions. We reach conclusions on how the optimal hedging strategy depends on the trader's objective function, the level of transaction costs, and the maturity of the options used for hedging. We also investigate the robustness of the hedging strategy to the process assumed for the underlying asset.

Keywords: D4PG; delta-neutral; derivatives; gamma; hedging; quantile regression; reinforcement learning; vega.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
RL architecture for actor-critic learning and proposed networks.
Figure 2
Figure 2
Comparison of gain distribution for delta hedging, delta-gamma-neutral hedging, and hedging by the VaR95 RL agent when transaction cost = 1%. Note that the 5th percentile of the gain distribution corresponds to the 95th percentile loss in Table 1.
Figure 3
Figure 3
Risk-return trade-offs that are possible using RL with different objective functions. The positive average cost of hedging reported in previous tables means the average return of the agent is negative.

References

    1. Barth-Maron G., Hoffman M. W., Budden D., Dabney W., Horgan D., Tb D. (2018). Distributed distributional deterministic policy gradients. arXiv. 1804.08617.
    1. Bellemare M. G., Dabney W., Munos R. (2017). “A distributional perspective on reinforcement learning,” in ICML'17 in proceedings of the 34th international conference on machine learning, vol. 70, 449–458. - PubMed
    1. Black F., Scholes M. (1973). The pricing of options and corporate liabilities. J. Polit. Econ. 81, 637–659. 10.1086/260062 - DOI - PubMed
    1. Buehler H., Gonon L., Teichmann J., Wood B. (2019). Deep hedging. Quant. Finan. 19, 1271–1291. 10.1080/14697688.2019.1571683 - DOI
    1. Cao J., Chen J., Hull J., Poulos Z. (2021). Deep hedging of derivatives using reinforcement learning. J. Finan. Data Sci. Winter 3, 10–27. 10.3905/jfds.2020.1.052 - DOI

LinkOut - more resources