Photonic reinforcement learning based on optoelectronic reservoir computing
- PMID: 35260595
- PMCID: PMC8904492
- DOI: 10.1038/s41598-022-07404-z
Photonic reinforcement learning based on optoelectronic reservoir computing
Abstract
Reinforcement learning has been intensively investigated and developed in artificial intelligence in the absence of training data, such as autonomous driving vehicles, robot control, internet advertising, and elastic optical networks. However, the computational cost of reinforcement learning with deep neural networks is extremely high and reducing the learning cost is a challenging issue. We propose a photonic on-line implementation of reinforcement learning using optoelectronic delay-based reservoir computing, both experimentally and numerically. In the proposed scheme, we accelerate reinforcement learning at a rate of several megahertz because there is no required learning process for the internal connection weights in reservoir computing. We perform two benchmark tasks, CartPole-v0 and MountanCar-v0 tasks, to evaluate the proposed scheme. Our results represent the first hardware implementation of reinforcement learning based on photonic reservoir computing and pave the way for fast and efficient reinforcement learning as a novel photonic accelerator.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures





Similar articles
-
Adaptive model selection in photonic reservoir computing by reinforcement learning.Sci Rep. 2020 Jun 22;10(1):10062. doi: 10.1038/s41598-020-66441-8. Sci Rep. 2020. PMID: 32572093 Free PMC article.
-
Parallel and deep reservoir computing using semiconductor lasers with optical feedback.Nanophotonics. 2022 Oct 17;12(5):869-881. doi: 10.1515/nanoph-2022-0440. eCollection 2023 Mar. Nanophotonics. 2022. PMID: 39634361 Free PMC article.
-
Transfer learning for photonic delay-based reservoir computing to compensate parameter drift.Nanophotonics. 2022 Oct 18;12(5):949-961. doi: 10.1515/nanoph-2022-0399. eCollection 2023 Mar. Nanophotonics. 2022. PMID: 39634352 Free PMC article.
-
Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning.Sensors (Basel). 2021 Feb 11;21(4):1278. doi: 10.3390/s21041278. Sensors (Basel). 2021. PMID: 33670109 Free PMC article. Review.
-
Recent advances in physical reservoir computing: A review.Neural Netw. 2019 Jul;115:100-123. doi: 10.1016/j.neunet.2019.03.005. Epub 2019 Mar 20. Neural Netw. 2019. PMID: 30981085 Review.
Cited by
-
Input-Output-Improved Reservoir Computing Based on Duffing Resonator Processing Dynamic Temperature Compensation for MEMS Resonant Accelerometer.Micromachines (Basel). 2023 Jan 8;14(1):161. doi: 10.3390/mi14010161. Micromachines (Basel). 2023. PMID: 36677222 Free PMC article.
-
High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit.Nat Commun. 2024 Feb 5;15(1):1044. doi: 10.1038/s41467-024-45305-z. Nat Commun. 2024. PMID: 38316815 Free PMC article.
References
-
- Andrae A, Edler T. On global electricity usage of communication technology: trends to 2030. Challenges. 2015;6:117–157.
-
- Haghighat MH, Li J. Intrusion detection system using voting-based neural network. Tsinghua Sci. Technol. 2021;26:484–495.
-
- Zhang J, Xu Q. Attention-aware heterogeneous graph neural network. Big Data Min. Anal. 2021;4:233–241.
-
- Bie Y, Yang Y. A multitask multiview neural network for end-to-end aspect-based sentiment analysis. Big Data Min. Anal. 2021;4:195–207.
-
- Sutton RS, Barto AG. Reinforcement Learning: An Introduction. Cambridge: The MIT Press; 2018.
Grants and funding
LinkOut - more resources
Full Text Sources