Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 13;15(1):6898.
doi: 10.1038/s41467-024-51093-3.

Single neuromorphic memristor closely emulates multiple synaptic mechanisms for energy efficient neural networks

Affiliations

Single neuromorphic memristor closely emulates multiple synaptic mechanisms for energy efficient neural networks

Christoph Weilenmann et al. Nat Commun. .

Erratum in

Abstract

Biological neural networks do not only include long-term memory and weight multiplication capabilities, as commonly assumed in artificial neural networks, but also more complex functions such as short-term memory, short-term plasticity, and meta-plasticity - all collocated within each synapse. Here, we demonstrate memristive nano-devices based on SrTiO3 that inherently emulate all these synaptic functions. These memristors operate in a non-filamentary, low conductance regime, which enables stable and energy efficient operation. They can act as multi-functional hardware synapses in a class of bio-inspired deep neural networks (DNN) that make use of both long- and short-term synaptic dynamics and are capable of meta-learning or learning-to-learn. The resulting bio-inspired DNN is then trained to play the video game Atari Pong, a complex reinforcement learning task in a dynamic environment. Our analysis shows that the energy consumption of the DNN with multi-functional memristive synapses decreases by about two orders of magnitude as compared to a pure GPU implementation. Based on this finding, we infer that memristive devices with a better emulation of the synaptic functionalities do not only broaden the applicability of neuromorphic computing, but could also improve the performance and energy costs of certain artificial intelligence applications.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Biologically inspired synaptic functions and their memristor implementation.
a Organization of the mammalian brain with several biological neurons connected through synapses. When a postsynaptic spike (light blue) coincides with a presynaptic spike (light green) the corresponding synaptic coupling is strengthened (Hebbian plasticity) for a limited amount of time (short-term plasticity). This bio-physical process is illustrated in the circular insets: (I) an influx of ions (e.g., Ca2+) through the postsynaptic voltage gated ion channels leads to (II) an increased number of synaptic receptors, which increases the synaptic weight. (III) The weight subsequently decays back to its original value due to the receptors gradually detaching from the membrane. b Table comparing the synaptic functions of artificial synapses in standard ANNs (Artificial column) and biological synapses (Biological column). The plot on the right shows the weight of a biological synapse as a function of time. The short-term weight (F) is updated (ΔF) when the pre- and post-synaptic spikes coincide. Additionally, the decay time of F can be controlled, which corresponds to meta-plasticity. c Bio-inspired Short-Term Plasticity Neuron (STPN) model combining a conventional neuron model with short-term Hebbian (ST-Hebb) synapses. d Hardware implementation of a neuromorphic ST-Hebb synapse with a Cr/Pt-SrTiO3-Ti memristor. The device measurement on the right mirrors the biological functions of ST-Hebb synapses combining memory and computation as well as long- (W) and short-term (F) dynamics.
Fig. 2
Fig. 2. DC and dynamical behavior of multi-functional memristive synapses.
a Conductance vs. voltage characteristic (30 cycles) of the fabricated Cr/Pt-STO-Ti memristors. The black arrows indicate the counter-clockwise switching direction. b Sketch of the device stack and of the underlying switching mechanism. The two insets zoom into the Pt-STO interface at different applied voltages, showing the dynamics of interfacial oxygen ions (O) and oxygen vacancies (VO): (Vapp > 0) At positive voltages VO formation and migration occurs. The negatively charged oxygen migrates towards the interface and into the porous Pt electrode and the positively charged VO move along the electric field away from the Pt electrode and towards the grounded Ti electrode. (Vapp ≤ 0) At zero applied voltage, VO's move back towards the Pt, driven by the built-in electrochemical gradient, where they get filled by oxygen. A negative voltage accelerates this process. c Conductance change from low to high under the application of 100 SET pulses with an amplitude of 4V and a duration of 500 μs. d Time-dependent conductance measurement (read out at 0.6V) when voltage pulses (2V, 2.5V, and 3V) with a duration of 100μs are applied. The pulses induce short-term increases of the conductance with subsequent decay. The long-term conductance (red area) remains constant. e Short-term conductance changes due to the voltage pulses in the dotted rectangle of (d). Only the conductance values during the read voltage are shown here. f Aggregate plot showing short-term plasticity for different values of the long-term weight W. The measurement data was obtained by first applying the protocol in (d) to characterize the short-term plasticity for the minimum long-term weight (W1). The long-term weight was then changed by 100 SET pulses (c) and, after a waiting period of 240s, the short-term plasticity was measured again.
Fig. 3
Fig. 3. Control over the magnitude and dynamics of short-term conductance updates.
a Mean (solid line) and standard deviation (shaded area) of pulse-induced short-term conductance updates (ΔF) from five conductance measurements and using four different pulse voltages (2, 2.5, 3, and 3.5V). The read voltage is set to 0.6V and the pulse width to 100 μs. To better compare the measurements, the conductance values were adjusted by subtracting the initial conductance at t=0 from the data. b Heatmap of the achieved ΔF for the different pulse voltages and widths. c Heatmap of the required pulse energy for the same voltage and width combinations as in (b). d (Top) Applied voltage protocol on a linear x-axis using 0.6V/200 μs read pulses with a period of 700 μs and (bottom) corresponding conductance values shown on a logarithmic x-axis. In between the read pulses a constant bias voltage (Vbias) of variable amplitude is applied. The main pulse voltage and width are set to 3.5V and 500 μs, respectively, in all measurements. The mean and standard deviation of the adjusted conductance values are shown for 5 measurements on a semi-log plot. e Extracted decay time constant Λ from the measurements in d) as a function of Vbias. The experimental data points were fitted with a sigmoid function.
Fig. 4
Fig. 4. Simulation and energy consumption of an STPN network with multi-functional synapses.
a Sketch of the full STPN network. A frame of the Atari game is fed into two convolutional layers: Conv(kernel, stride) plus a ReLU activation function. The features are then fed into the m-STPN layer (blue ellipses and lines). The layer’s output is split into actions and a value by two fully connected linear layers. b, c Average reward as a function of agent steps during training for (b) three different implementations of the STPN layer and (c) five different ranges of Λ. Each curve represents the average reward of 16 agents with different random parameter initialization. The shaded area denotes the standard deviation. In the inset of c), the cases Λ = 0 and Λ = [0.08, 0.92] (i.e., the achievable device range) are shown. d Total synaptic weight (long- and short-term component) of a single synapse of the trained network (SmaxF}) during an entire game. The Zoom-in additionally shows the long-term weight W in red and the ΔF as black bars. e Energy consumed by our memristors due to ΔF updates, i.e., voltage pulses with widths wp, fitted by a power-law. f Power consumed by our memristors due to different Decay bias voltages (Vbias). g Time evolution of the energy consumption of the synapse in (d) during an entire Pong game for a memristor (blue) and a pure GPU implementation (orange). Different energy contributions and the total energy are shown. h Histograms of all synapses in the network, indicating how many synapses consume a specific amount of energy during the whole Pong game. The two contributions ΔF and Decay are shown. For the Decay the worst case scenario: Vbias = 0.6 is assumed for all synapses. i Total energy histogram (ΔF plus Decay).

References

    1. Markram, H., Gerstner, W. & Sjöström, P. J. Spike-timing-dependent plasticity: a comprehensive overview. Front. Synaptic Neurosci.4, 2 (2012). - PMC - PubMed
    1. Erickson, M. A., Maramara, L. A. & Lisman, J. A single brief burst induces GluR1-dependent associative short-term potentiation: a potential mechanism for short-term memory. J. Cogn. Neurosci.22, 2530–2540 (2010). - PMC - PubMed
    1. Zucker, R. S. & Regehr, W. G. Short-term synaptic plasticity. Annu. Rev. Physiol.64, 355–405 (2002). - PubMed
    1. Wang, Y. et al. Heterogeneity in the pyramidal network of the medial prefrontal cortex. Nat. Neurosci.9, 534–542 (2006). - PubMed
    1. Abraham, W. C. & Bear, M. F. Metaplasticity: the plasticity of synaptic plasticity. Trends Neurosci.19, 126–130 (1996). - PubMed

LinkOut - more resources