Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 10;21(8):2678.
doi: 10.3390/s21082678.

Spatial Memory in a Spiking Neural Network with Robot Embodiment

Affiliations

Spatial Memory in a Spiking Neural Network with Robot Embodiment

Sergey A Lobov et al. Sensors (Basel). .

Abstract

Cognitive maps and spatial memory are fundamental paradigms of brain functioning. Here, we present a spiking neural network (SNN) capable of generating an internal representation of the external environment and implementing spatial memory. The SNN initially has a non-specific architecture, which is then shaped by Hebbian-type synaptic plasticity. The network receives stimuli at specific loci, while the memory retrieval operates as a functional SNN response in the form of population bursts. The SNN function is explored through its embodiment in a robot moving in an arena with safe and dangerous zones. We propose a measure of the global network memory using the synaptic vector field approach to validate results and calculate information characteristics, including learning curves. We show that after training, the SNN can effectively control the robot's cognitive behavior, allowing it to avoid dangerous regions in the arena. However, the learning is not perfect. The robot eventually visits dangerous areas. Such behavior, also observed in animals, enables relearning in time-evolving environments. If a dangerous zone moves into another place, the SNN remaps positive and negative areas, allowing escaping the catastrophic interference phenomenon known for some AI architectures. Thus, the robot adapts to changing world.

Keywords: STDP; cognitive maps; learning; neurorobotics; spiking neural networks; vector field of functional connections; vector field of synaptic connections.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Examples of vector fields defined by synaptic connections in four small neural assemblies. Black arrows show the direction and magnitude of the synaptic vector field in a grid containing 7 × 23 cells (black points correspond to zero vectors). The thickness of the red links connecting neurons is proportional to the weight of interneuron couplings.
Figure 2
Figure 2
Emergence of traveling patches of neural activity and their relation with the vector field of functional connections (black arrow). (A) Spontaneous firing before stimulation (red and blue circles mark excited excitatory and inhibitory neurons, respectively; the time instants tb of the network burst refers to the interval past from the stimulus onset). (B) Network activity during periodic local stimulation. (C) Spontaneous activity after long-term stimulation.
Figure 3
Figure 3
Comparison of the vector fields of anatomical (black arrows) and functional (red arrow) connections during neuronal activity before (A), during (B), and after (C) a period of long-term local stimulation. The intensity of the magenta color of the cells is proportional to the field difference.
Figure 4
Figure 4
Characteristics of global network memory. (A) Top: The vector field of the synaptic weights (thin black arrows) of a segment of the neural network before external stimulation (only the first quadrant of the network is shown). The thick black arrow, gc, shows the global connectivity vector of the selected segment. The magenta-framed inset shows incoming couplings for a representative neuron (line thickness corresponds to the coupling weight). Bottom: A typical epoch of the raster plot during the stimulation (each point displays a spike of the corresponding neuron; neurons have been sorted by their distance to the stimulation site). (B) Same as in (A) but immediately after external stimulation. The thick red arrow, gst, is the vector of global memory after stimulation. (C) Same as in (A) but after a period of forgetting. The cosine of the angle between the connectivity vector at the current time gc (black) and the vector after stimulation gst (dashed, red) is the measure of global network memory.
Figure 5
Figure 5
Correlation of the learning curve with the global network memory measure. (A) Averaged relative synchronization time (learning curve) and memory measure M as functions of the stimulus number. The time interval between successive stimulations is 1043×104 s, the noise intensity D = 4.8 (n = 10). The relative synchronization time is the time required to synchronize the network, normalized to the synchronization time at the first stimulation. M was evaluated just before the stimulation. (B) Same as in (A) but for the increased time interval between stimulations (5×104105 s) and the intensity of neural noise (D = 5.1–5.5; n = 10). (C) Synchronization time versus the global memory measure (n = 200).
Figure 6
Figure 6
Embodiment of the SNN in a moving robot. (A) The robot is controlled by the SNN and moves in a square arena divided into safe and dangerous (marked with a grid) zones. (B) The robot location determines the stimulation area in the network space (red circle). The stimulation frequency is 1 and 10 Hz for the safe and dangerous zones, respectively. (C) The vector field of functional connections (red arrows) in the SNN determines the robot’s direction and speed (red circle delimits the area controlling the robot movement).
Figure 7
Figure 7
Adapting the robot behavior to changing environment. (A) Top: Traces of the robot movements in the arena without dangerous zones (all four quadrants are safe). The robot randomly explored the environment. Bottom: Moving average of the time spent by the robot in quadrants I (blue curve) and III (red curve). (B) Same as in (A) but during movement in the arena with the danger zone in quadrant III. The robot spent little time in the dangerous quadrant (red curve), while the time spent in quadrant I rose (blue curve). (C) Same as in (B) but during relearning. The dangerous zone had moved from quadrant III to I. The robot learned the new dangerous area and recovered the previous one that was now safe. The times spent by the robot in quadrants I and III were inverted (the blue curve decayed, while the red one increased).

Similar articles

Cited by

References

    1. Anokhin K.V. The brain and memory: The biology of traces of time past. Her. Russ. Acad. Sci. 2010;80:237–242. doi: 10.1134/S101933161003007X. - DOI
    1. Frankland P.W., Bontempi B. The organization of recent and remote memories. Nat. Rev. Neurosci. 2005;6:119–130. doi: 10.1038/nrn1607. - DOI - PubMed
    1. Eichenbaum H. Memory: Organization and control. Annu. Rev. Psychol. 2017;68:19–45. doi: 10.1146/annurev-psych-010416-044131. - DOI - PMC - PubMed
    1. Snoddy G.S. Learning and stability: A psychophysiological analysis of a case of motor learning with clinical applications. J. Appl. Psychol. 1926;10:1–36. doi: 10.1037/h0075814. - DOI
    1. Crossman E.R.F.W. A theory of the acquisition of speed-skill. Ergonomics. 1959;2:153–166. doi: 10.1080/00140135908930419. - DOI

LinkOut - more resources