Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 28;23(5):2662.
doi: 10.3390/s23052662.

A Self-Adaptive Gallery Construction Method for Open-World Person Re-Identification

Affiliations

A Self-Adaptive Gallery Construction Method for Open-World Person Re-Identification

Sara Casao et al. Sensors (Basel). .

Abstract

Person re-identification, or simply re-id, is the task of identifying again a person who has been seen in the past by a perception system. Multiple robotic applications, such as tracking or navigate-and-seek, use re-identification systems to perform their tasks. To solve the re-id problem, a common practice consists in using a gallery with relevant information about the people already observed. The construction of this gallery is a costly process, typically performed offline and only once because of the problems associated with labeling and storing new data as they arrive in the system. The resulting galleries from this process are static and do not acquire new knowledge from the scene, which is a limitation of the current re-id systems to work for open-world applications. Different from previous work, we overcome this limitation by presenting an unsupervised approach to automatically identify new people and incrementally build a gallery for open-world re-id that adapts prior knowledge with new information on a continuous basis. Our approach performs a comparison between the current person models and new unlabeled data to dynamically expand the gallery with new identities. We process the incoming information to maintain a small representative model of each person by exploiting concepts of information theory. The uncertainty and diversity of the new samples are analyzed to define which ones should be incorporated into the gallery. Experimental evaluation in challenging benchmarks includes an ablation study of the proposed framework, the assessment of different data selection algorithms that demonstrate the benefits of our approach, and a comparative analysis of the obtained results with other unsupervised and semi-supervised re-id methods.

Keywords: incremental clustering; open-world recognition; person recognition.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Figures

Figure 1
Figure 1
Simplified comparison between a large static gallery, traditionally used, and our small self-adaptive gallery. Both have a set of images representing each identity (ID0, ID1, …), i.e., each person. The traditional gallery is the same for every person query that arrives at different times (ti, tf). However, because the adaptive gallery is being built and updated as new data arrives, we can appreciate a more comprehensive gallery for later times (ti<tn<tf).
Figure 2
Figure 2
Self-adaptive gallery construction method overview. The person bounding box undergoes a pre-processing where the sample features are obtained with existing deep neural network encoders. Then, the proposed method analyzes the features obtained to decide which ones are used to adapt and evolve the gallery with the new information.
Figure 3
Figure 3
Gallery optimization. Upper area: example simplified where C1 exceeds the memory budget and the gallery optimization selects the feature with the maximum cost to be dropped, x4. Lower area: visual sample of two appearance models from similar identities that are correctly separated in the DukeMTMC-VideoReID dataset. The yellow edge corresponds to identity 86 and the orange edge to identity 194, both ground truth identities.
Figure 4
Figure 4
Parameter evaluation in the gallery construction process using the MARS dataset: (a) effect of the weight assigned to uncertainty and diversity (γ); (b) influence of the expansion threshold (τ); (c) effect of the minimum size to create a class and the memory budget (l/m).
Figure 5
Figure 5
Evolution over time of the metrics with the final parameters set in the gallery construction process using the MARS dataset .
Figure 6
Figure 6
Data selection method comparison with the MARS dataset. We analyze (a) the number of classes created (x-axis) per GT-ID in the dataset (y-axis) showing the number of GT-ID with more than one class associated or those GT-ID that have not been correctly found, i.e., 0 classes associated; (b) gallery structure metrics: F1, precision, and recall; (c) sample classification F1 analyzing the influence of varying K; (d) class precision.
Figure 7
Figure 7
Visualization of the evolution of appearance models in the gallery. Each row corresponds to gallery samples at a certain time. The different colors represent the time stamp of the samples included in the gallery (best viewed in color).

References

    1. Wu D., Zheng S.J., Zhang X.P., Yuan C.A., Cheng F., Zhao Y., Lin Y.J., Zhao Z.Q., Jiang Y.L., Huang D.S. Deep learning-based methods for person re-identification: A comprehensive review. Neurocomputing. 2019;337:354–371. doi: 10.1016/j.neucom.2019.01.079. - DOI
    1. Ye M., Shen J., Lin G., Xiang T., Shao L., Hoi S.C. Deep learning for person re-identification: A survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. 2021;44:2872–2893. doi: 10.1109/TPAMI.2021.3054775. - DOI - PubMed
    1. De Langis K., Sattar J. Realtime multi-diver tracking and re-identification for underwater human-robot collaboration; Proceedings of the International Conference on Robotics and Automation; Paris, France. 31 May–31 August 2020; pp. 11140–11146.
    1. Chang F.M., Lian F.L., Chou C.C. Integration of modified inverse observation model and multiple hypothesis tracking for detecting and tracking humans. IEEE Trans. Autom. Sci. Eng. 2015;13:160–170. doi: 10.1109/TASE.2015.2426712. - DOI
    1. Truong X.T., Ngo T.D. Toward socially aware robot navigation in dynamic and crowded environments: A proactive social motion model. IEEE Trans. Autom. Sci. Eng. 2017;14:1743–1760. doi: 10.1109/TASE.2017.2731371. - DOI

LinkOut - more resources