Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Feb 28;25(5):1519.
doi: 10.3390/s25051519.

A Survey of Robotic Monocular Pose Estimation

Affiliations
Review

A Survey of Robotic Monocular Pose Estimation

Kun Zhang et al. Sensors (Basel). .

Abstract

Robotic monocular pose estimation is an important part of neural monocular pose estimation-driven methods, which includes monocular simultaneous localization and mapping (SLAM) and single-view object pose estimation (OPE) driven by neural methods. The mapping thread leeches onto robotic monocular pose estimation. Robotic monocular pose estimation consists of the localization part of monocular SLAM and the object pose solving part of single-view OPE. Depth prediction neural networks, semantics, neural implicit representations, and large language models (LLMs) are neural methods that have been important components of neural monocular pose estimation-driven methods. Complete robotic monocular pose estimation is a potential module in real robots. Possible future research directions and applications are discussed.

Keywords: monocular SLAM; neural methods; robotic monocular pose estimation; robots; single-view OPE.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The structure of the survey.
Figure 2
Figure 2
The pipeline of iNeRF-OPE.
Figure 3
Figure 3
An example of robotic grasping with a single object.
Figure 4
Figure 4
The distribution of coarse pose.
Figure 5
Figure 5
The tendency of neural monocular pose estimation-driven methods and semantic plugging.

References

    1. Chang Y., Wang X., Wang J., Wu Y., Yang L., Zhu K., Chen H., Yi X., Wang C., Wang Y., et al. A Survey on Evaluation of Large Language Models. ACM Trans. Intel. Syst. Technol. 2024;15:1–45. doi: 10.1145/3641289. - DOI
    1. Tawhid M., Ludher A.S., Hashim H.A. Stochastic Observer for SLAM on the Lie Group; Proceedings of the Modeling, Estimation and Control Conference (MECC); Austin, TX, USA. 24–27 October 2021.
    1. Engel J., Koltun V., Cremers D. Direct Sparse Odometry. IEEE Trans. Pattern Anal. Mach. Intell. 2018;40:611–625. doi: 10.1109/TPAMI.2017.2658577. - DOI - PubMed
    1. Hashim H.A., Eltoukhy A.E.E. Nonlinear Filter for Simultaneous Localization and Mapping on a Matrix Lie Group Using IMU and Feature Measurements. IEEE Trans. Syst. Man. Cybern. Syst. 2022;52:2098–2109. doi: 10.1109/TSMC.2020.3047338. - DOI
    1. Liu Z., Zhang F. BALM: Bundle Adjustment for Lidar Mapping. IEEE Robot. Autom. Lett. 2021;6:3184–3191. doi: 10.1109/LRA.2021.3062815. - DOI

LinkOut - more resources