Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Jan 29;23(3):1504.
doi: 10.3390/s23031504.

Person Re-Identification with RGB-D and RGB-IR Sensors: A Comprehensive Survey

Affiliations
Review

Person Re-Identification with RGB-D and RGB-IR Sensors: A Comprehensive Survey

Md Kamal Uddin et al. Sensors (Basel). .

Abstract

Learning about appearance embedding is of great importance for a variety of different computer-vision applications, which has prompted a surge in person re-identification (Re-ID) papers. The aim of these papers has been to identify an individual over a set of non-overlapping cameras. Despite recent advances in RGB-RGB Re-ID approaches with deep-learning architectures, the approach fails to consistently work well when there are low resolutions in dark conditions. The introduction of different sensors (i.e., RGB-D and infrared (IR)) enables the capture of appearances even in dark conditions. Recently, a lot of research has been dedicated to addressing the issue of finding appearance embedding in dark conditions using different advanced camera sensors. In this paper, we give a comprehensive overview of existing Re-ID approaches that utilize the additional information from different sensor-based methods to address the constraints faced by RGB camera-based person Re-ID systems. Although there are a number of survey papers that consider either the RGB-RGB or Visible-IR scenarios, there are none that consider both RGB-D and RGB-IR. In this paper, we present a detailed taxonomy of the existing approaches along with the existing RGB-D and RGB-IR person Re-ID datasets. Then, we summarize the performance of state-of-the-art methods on several representative RGB-D and RGB-IR datasets. Finally, future directions and current issues are considered for improving the different sensor-based person Re-ID systems.

Keywords: RGB–D sensors; RGB–IR sensors; cross-modal; multi-modal; re-identification; video surveillance.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Sample images showing the challenges related to camera variations and environmental conditions in the Re-ID problem. (a) shows pose and viewpoint variations; (b) background clutter; (c) partial occlusion; and (d) illumination variations. Images were taken from the public standard datasets IAS-Lab [14], i-LIDS [15] and Market-1501 [16].
Figure 2
Figure 2
Sample images showing the RGB, depth, and skeleton information of an individual in the RobotPKU RGBD-ID [33] dataset.
Figure 3
Figure 3
General categories of RGB–D and IR sensor-based person re-identification systems.
Figure 4
Figure 4
Different multi-modal person re-identification approaches (feature/score-level fusion), where each approach uses different modalities. (a) RGB and depth modalities; (b) RGB and skeleton information; (c) depth and skeleton information; (d) RGB, depth, and skeleton information; and (e) RGB, depth, and thermal images [44].
Figure 5
Figure 5
A flowchart of the RGB–depth image-based Re-ID framework. (a) Score-level fusion and (b) feature-level fusion techniques.
Figure 6
Figure 6
A flowchart of the RGB–skeleton information-based Re-ID framework.
Figure 7
Figure 7
A flowchart of the depth–skeleton information-based Re-ID framework.
Figure 8
Figure 8
A flowchart of the RGB–depth–skeleton-based Re-ID.
Figure 9
Figure 9
A flowchart of RGB–depth–skeleton-based Re-ID.
Figure 10
Figure 10
Illustration of cross-modal person Re-ID. (a) Sketch–RGB cross-modal Re-ID uses sketch as a query and RGB as a gallery or vice versa. (b) IR–RGB cross-modal Re-ID uses IR as a query and RGB as a gallery or vice versa. (c) Depth–RGB cross-modal Re-ID uses depth as a query and RGB as a gallery or vice versa.
Figure 11
Figure 11
The general architecture of RGB–depth cross-modal Re-ID.
Figure 12
Figure 12
Sample images showing frontal, rear, and side views of an individual, with the camera installed at a horizontal viewpoint.
Figure 13
Figure 13
Sample image taken from the TVPR dataset showing an overhead view of a person.
Figure 14
Figure 14
Sample images showing three modalities (RGB, depth, and thermal) taken from the RGB-D-T dataset.
Figure 15
Figure 15
Example images showing sketches of different persons and their corresponding RGB images.
Figure 16
Figure 16
The left figure shows sample images from the SYSU-MM01 dataset and the right one shows sample images from the RegDB dataset.

References

    1. Gheissari N., Sebastian T.B., Hartley R. Person reidentification using spatiotemporal appearance; Proceedings of the IEEE Computer Society Conference on Computer Vision and pattern Recognition; New York, NY, USA. 17–22 June 2006; pp. 1528–1535.
    1. Islam K. Person search: New paradigm of person re-identification: A survey and outlook of recent works. Image Vis. Comput. 2020;101:103970. doi: 10.1016/j.imavis.2020.103970. - DOI
    1. Zheng L., Yang Y., Hauptmann A.G. Person re-identification: Past, Present and Future. arXiv. 20161610.02984
    1. Wu D., Zheng S.J., Zhang X.P., Yuan C.A., Cheng F., Zhao Y., Lin Y.J., Zhao Z.Q., Jiang Y.L., Huang D.S. Deep learning-based methods for person re-identification: A comprehensive review. Neurocomputing. 2019;337:354–371. doi: 10.1016/j.neucom.2019.01.079. - DOI
    1. Bedagkar-Gala A., Shah S.K. A survey of approaches and trends in person re-identification. Image Vis. Comput. 2014;32:270–286. doi: 10.1016/j.imavis.2014.02.001. - DOI

LinkOut - more resources