. 2018 Dec 23;19(1):53.

doi: 10.3390/s19010053.

Depth from a Motion Algorithm and a Hardware Architecture for Smart Cameras

Abiel Aguilar-González^{1

2}, Miguel Arias-Estrada³, François Berry⁴

Affiliations

¹ Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Tonantzintla 72840, Mexico. abiel@inaoep.mx.
² Institut Pascal, Université Clermont Auvergne (UCA), 63178 Clermont-Ferrand, France. abiel@inaoep.mx.
³ Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Tonantzintla 72840, Mexico. ariasmo@inaoep.mx.
⁴ Institut Pascal, Université Clermont Auvergne (UCA), 63178 Clermont-Ferrand, France. francois.berry@uca.fr.

PMID: 30583606
PMCID: PMC6338951
DOI: 10.3390/s19010053

Depth from a Motion Algorithm and a Hardware Architecture for Smart Cameras

Abiel Aguilar-González et al. Sensors (Basel). 2018.

. 2018 Dec 23;19(1):53.

doi: 10.3390/s19010053.

Authors

Abiel Aguilar-González^{1

2}, Miguel Arias-Estrada³, François Berry⁴

Affiliations

¹ Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Tonantzintla 72840, Mexico. abiel@inaoep.mx.
² Institut Pascal, Université Clermont Auvergne (UCA), 63178 Clermont-Ferrand, France. abiel@inaoep.mx.
³ Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Tonantzintla 72840, Mexico. ariasmo@inaoep.mx.
⁴ Institut Pascal, Université Clermont Auvergne (UCA), 63178 Clermont-Ferrand, France. francois.berry@uca.fr.

PMID: 30583606
PMCID: PMC6338951
DOI: 10.3390/s19010053

Abstract

Applications such as autonomous navigation, robot vision, and autonomous flying require depth map information of a scene. Depth can be estimated by using a single moving camera (depth from motion). However, the traditional depth from motion algorithms have low processing speeds and high hardware requirements that limit the embedded capabilities. In this work, we propose a hardware architecture for depth from motion that consists of a flow/depth transformation and a new optical flow algorithm. Our optical flow formulation consists in an extension of the stereo matching problem. A pixel-parallel/window-parallel approach where a correlation function based on the sum of absolute difference (SAD) computes the optical flow is proposed. Further, in order to improve the SAD, the curl of the intensity gradient as a preprocessing step is proposed. Experimental results demonstrated that it is possible to reach higher accuracy (90% of accuracy) compared with previous Field Programmable Gate Array (FPGA)-based optical flow algorithms. For the depth estimation, our algorithm delivers dense maps with motion and depth information on all image pixels, with a processing speed up to 128 times faster than that of previous work, making it possible to achieve high performance in the context of embedded applications.

Keywords: FPGA (Field Programmable Gate Array); depth estimation; monocular systems; optical flow; smart cameras.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
Block diagram of the proposed algorithm.

**Figure 2**
The optical flow step: first, curl images $(\bar{{Curl}_{t}} (x, y))$ , $(\bar{{Curl}_{t + 1}} (x, y))$ are computed. Then, given the curl images for two consecutive frames, pixels displacements $Δ_{x} (x, y)$ , $Δ_{y} (x, y)$ (optical flow for all pixels in the reference image) are computed using a dynamic template based on the optical flow previously computed $(Δ_{x, t - 1} (x, y), Δ_{y, t - 1} (x, y))$ .

**Figure 3**
Curl computation example. Input image taken from the KITTI benchmark dataset [35].

**Figure 4**
Optical flow example. Image codification as proposed in the Tsukuba benchmark dataset [36].

**Figure 5**
The proposed optical flow algorithm formulation: patch size = 10, search size = 10, and sampling value = 2. For each pixel in the reference image $f_{t}$ , n overlapped regions are constructed in $f_{t + 1}$ , and the n region center that minimizes or maximizes any similarity metric is the tracked position (flow) of the pixel $(x, y)$ at $f_{t + 1}$ .

**Figure 6**
(a) Epipolar geometry: depth in the scene is proportional to the disparity value, i.e., far objects have low disparity values, while closer objects are associated with high disparity values. To compute the disparity map (disparities for all pixels in the image) a stereo pair (two images with epipolar geometry) are needed. (b) Single moving camera: in this work we suppose that depth in the scene is proportional to the pixel velocity across the time. To compute the pixel velocity, optical flow across two consecutive frames has to be computed.

**Figure 7**
Depth estimation using the proposed algorithm.

**Figure 8**
FPGA architecture for the proposed algorithm.

**Figure 9**
FPGA architecture for the “frame buffer” unit. Two external memories configured in switching mode makes it possible to store the current frame (time t) into a DRAM configured in write mode, while another DRAM (in read mode) deliver pixel flow for a previous frame (frame at time $t - 1$ ).

**Figure 10**
FPGA architecture for the optical flow estimation.

**Figure 11**
The circular buffers architecture. For an $n \times n$ patch, a shift mechanism “control” unit manages the read/write addresses of $n + 1$ BRAMs. In this formulation, n BRAMs are in read mode, and one BRAM is in write mode in each clock cycle. The $n \times n$ buffer then delivers logic registers with all pixels within the patch in parallel.

**Figure 12**
FPGA architecture for the “curl” unit.

**Figure 13**
FPGA architecture for the “depth estimation” unit.

**Figure 14**
Accuracy performance for different FPGA-based optical flow algorithms.

**Figure 15**
Optical flow: quantitative/qualitative results for the KITTI dataset.

**Figure 16**
Depth estimation: quantitative/qualitative results for the KITTI dataset.

**Figure 17**
The KITTI dataset: Sequence 00; 3D reconstruction by the proposed approach. Our algorithm provides rough depth maps (a lower accuracy compared with previous algorithms) but with real-time processing and with the capability to be implemented in embedded hardware; as a result, real-time dense 3D reconstructions can be obtained, and these can be exploited by several real world applications such as augmented reality, robot vision and surveillance, and autonomous flying.

See this image and copyright information in PMC

Cited by

High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach.
Osuna-Coutiño JAJ, Martinez-Carranza J. Osuna-Coutiño JAJ, et al. Sensors (Basel). 2019 Jan 29;19(3):563. doi: 10.3390/s19030563. Sensors (Basel). 2019. PMID: 30700031 Free PMC article.
Multi-Scale Spatio-Temporal Feature Extraction and Depth Estimation from Sequences by Ordinal Classification.
Liu Y. Liu Y. Sensors (Basel). 2020 Apr 1;20(7):1979. doi: 10.3390/s20071979. Sensors (Basel). 2020. PMID: 32244820 Free PMC article.

References

1. Hengstler S., Prashanth D., Fong S., Aghajan H. MeshEye: A hybrid-resolution smart camera mote for applications in distributed intelligent surveillance; Proceedings of the 6th International Conference on Information Processing in Sensor Networks; Cambridge, MA, USA. 25–27 April 2007; pp. 360–369.
1. Aguilar-González A., Arias-Estrada M. Towards a smart camera for monocular SLAM; Proceedings of the 10th International Conference on Distributed Smart Camera; Paris, France. 12–15 September 2016; pp. 128–135.
1. Carey S.J., Barr D.R., Dudek P. Low power high-performance smart camera system based on SCAMP vision sensor. J. Syst. Archit. 2013;59:889–899. doi: 10.1016/j.sysarc.2013.03.016. - DOI
1. Birem M., Berry F. DreamCam: A modular FPGA-based smart camera architecture. J. Syst. Archit. 2014;60:519–527. doi: 10.1016/j.sysarc.2014.01.006. - DOI
1. Bourrasset C., Maggianiy L., Sérot J., Berry F., Pagano P. Distributed FPGA-based smart camera architecture for computer vision applications; Proceedings of the 2013 Seventh International Conference on Distributed Smart Cameras (ICDSC); Palm Springs, CA, USA. 29 October–1 November 2013; pp. 1–2.

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Depth from a Motion Algorithm and a Hardware Architecture for Smart Cameras

Affiliations

Depth from a Motion Algorithm and a Hardware Architecture for Smart Cameras

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources