Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 12:7:40703.
doi: 10.1038/srep40703.

A spiking neural network model of 3D perception for event-based neuromorphic stereo vision systems

Affiliations

A spiking neural network model of 3D perception for event-based neuromorphic stereo vision systems

Marc Osswald et al. Sci Rep. .

Erratum in

Abstract

Stereo vision is an important feature that enables machine vision systems to perceive their environment in 3D. While machine vision has spawned a variety of software algorithms to solve the stereo-correspondence problem, their implementation and integration in small, fast, and efficient hardware vision systems remains a difficult challenge. Recent advances made in neuromorphic engineering offer a possible solution to this problem, with the use of a new class of event-based vision sensors and neural processing devices inspired by the organizing principles of the brain. Here we propose a radically novel model that solves the stereo-correspondence problem with a spiking neural network that can be directly implemented with massively parallel, compact, low-latency and low-power neuromorphic engineering devices. We validate the model with experimental results, highlighting features that are in agreement with both computational neuroscience stereo vision theories and experimental findings. We demonstrate its features with a prototype neuromorphic hardware system and provide testable predictions on the role of spike-based representations and temporal dynamics in biological stereo vision processing systems.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The spiking stereo network.
Detailed view of a horizontal layer of the network. An object is sensed by two eyes and accordingly projected onto their retinal cells. The spiking output of these cells is spatio-temporally correlated (coincidence detectors) and integrated (disparity detectors). The final output encodes a representation of the original scene in disparity space (x, y, d). For the sake of visibility, only a horizontal line of retinal cells, at fixed vertical cyclopean position y, is considered. The corresponding coincidence and disparity detector units, hence, lie within a horizontal plane (spanned by x and d). Only a few units are shown here whereas in the complete network, the units are uniformly distributed over the entire plane. The shaded planes indicate how the network expands vertically over y. More details on how the neurons are connected among each other is provided in the Methods section.
Figure 2
Figure 2. Successful resolution of the stereo correspondence problem by the spiking neural network.
The recorded scene consists of two people that move in opposite directions at different depths. Here, the two depth maps were generated by binning the output spikes of the network into 30 ms bins at times t1 and t2 respectively. The corresponding 3D reconstructions (red and green dots) are overlayed with the ground-truth data obtained from a Kinect sensor (gray). The color encodes the polarity, which is obtained from the event-based sensor.
Figure 3
Figure 3. Inhibition of ambiguity in the stereo network.
Spiking activity of coincidence (blue) and disparity (brown) detectors at varying disparities accumulated over the full duration of the walking scene. The inset shows a disparity map generated from a short section of the scene. The two people are labeled (A) and (B) accordingly.
Figure 4
Figure 4. The stereo network’s response to a dynamic random dot stereogram (dRDS).
(A) Schematic of the dRDS stimulus for the left and right eye. (B) Ground-truth disparity image. Disparity is encoded by color ranging from near (red) to far (blue). (C) Disparity map generated from accumulated responses of the network while the dRDS stimulus was presented for 1 second.
Figure 5
Figure 5. Emulation of the stereo network on a neuromorphic processor.
(A) A RDS stimulus (printed on a chart) was moved at specified depths in front of a pair of dynamic vision sensors. The depth of the stimulus was detected by 30 disparity neurons covering an equally spaced disparity range from −10 to +10. Disparity detectors were emulated by a neuromorphic processor. (B) Spike event output of the disparity neurons during stimulus presentation. The spikes from the neuron encoding the true disparity are colored red, while all the others are colored blue. The histogram shown on the right represents the distribution of disparity spikes for the time where the stimulus was at a fixed disparity d = 6 (indicated by the grey shaded region).
Figure 6
Figure 6. The silicon retina.
(A) DAVIS sensor. (B) Principle of ON and OFF spikes generation of DVS pixels, adapted from ref. . Top: the evolution of pixel’s voltage Vp proportional to the log intensity. Bottom: the corresponding generation of ON (voltage increases above change threshold) and OFF (voltage decreases) events, from which the evolution of Vp can be reconstructed.

References

    1. Nityananda V. et al.. Insect stereopsis demonstrated using a 3d insect cinema. Scientific reports 6 (2016). - PMC - PubMed
    1. Tippetts B., Lee D. J., Lillywhite K. & Archibald J. Review of stereo vision algorithms and their suitability for resource-limited systems. Journal of Real-Time Image Processing 1–21 (2013).
    1. Barry B. et al.. Always-on vision processing unit for mobile applications. IEEE Micro 56–66 (2015).
    1. Chicca E., Stefanini F., Bartolozzi C. & Indiveri G. Neuromorphic electronic circuits for building autonomous cognitive systems. Proceedings of the IEEE 102, 1367–1388 (2014).
    1. Neftci E. et al.. Synthesizing cognition in neuromorphic electronic systems. Proceedings of the National Academy of Sciences 110, E3468–E3476 (2013). - PMC - PubMed

Publication types