Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun 27:3:25.
doi: 10.1186/s41235-018-0114-7. eCollection 2018 Dec.

Improving face identification with specialist teams

Affiliations

Improving face identification with specialist teams

Tarryn Balsdon et al. Cogn Res Princ Implic. .

Abstract

People vary in their ability to identify faces, and this variability is relatively stable across repeated testing. This suggests that recruiting high performers can improve identity verification accuracy in applied settings. Here, we report the first systematic study to evaluate real-world benefits of selecting high performers based on performance in standardized face identification tests. We simulated a recruitment process for a specialist team tasked with detecting fraudulent passport applications. University students (n = 114) completed a battery of screening tests followed by a real-world face identification task that is performed routinely when issuing identity documents. Consistent with previous work, individual differences in the real-world task were relatively stable across repeated tests taken 1 week apart (r = 0.6), and accuracy scores on screening tests and the real-world task were moderately correlated. Nevertheless, performance gains achieved by selecting groups based on screening tests were surprisingly small, leading to a 7% improvement in accuracy. Statistically aggregating decisions across individuals-using a 'wisdom of crowds' approach-led to more substantial gains than selection alone. Finally, controlling for individual accuracy of team members, the performance of a team in one test predicted their performance in a subsequent test, suggesting that a 'good team' is not only defined by the individual accuracy of team members. Overall, these results underline the need to use a combination of approaches to improve face identification performance in professional settings.

Keywords: Face recognition; Identity verification; Individual differences; Personnel selection; Super-recognizers; Unfamiliar face matching.

PubMed Disclaimer

Conflict of interest statement

This study was approved by the Human Research Ethics Committee at UNSW Sydney. All participants written informed consent.All people pictured in Fig. 1 provided written informed consent and appropriate photographic release.The authors declare that they have no competing interests.Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
An illustration of the trial design in the Face Recognition Candidate List Review Task (FR-Task). This task simulated the workflow of passport officers using face recognition software to screen for identity fraud in passport applications. On each trial, participants were asked to review four faces that have been flagged by the software as potential matches to the ‘applicant’. Participants made sequential pairwise similarity ratings by comparing the four faces to the applicant (left), before reviewing and confirming these ratings in a gallery review screen (right). The images shown are representative of the images presented to participants, but for privacy reasons we are unable to reproduce the passport images used in the study
Fig. 2
Fig. 2
Accuracy distributions for face identification tests. Means are shown in the solid black lines and medians in the red dashed lines. The width of each ‘violin’ represents the expected probability density of that score based on the data and assuming a Gaussian distribution. Individual data points are shown as grey dots. See main text for details. CFMT Cambridge face memory test, FR-Task Face Recognition Candidate List Review Task, GFMT Glasgow face matching test
Fig. 3
Fig. 3
Results of Analysis 2, showing session-two Face Recognition Candidate List Review Task (FR-Task) performance in groups selected using session one screening tests. Continuous black lines show the average of all participants (n = 114). Violin plots are presented in three columns pertaining to three selection thresholds (above mean, above one standard deviation (SD), top 10 performers), and show session-two FR Test performance for groups performing above the threshold in each screening measure. Within each violin, black lines represent group mean, red lines are the median, and data points are individual participants. Because only one participant was in the Top 10 performers for face identification accuracy on all screening tests, an orange point shows their performance. CFMT Cambridge face memory test, GFMT Glasgow face matching test, PI-20 20-item prosopagnosia index, ROC receiver operating characteristic
Fig. 4
Fig. 4
Results of Analysis 3 showing average team performance on FR-Task in session two as a function of the number of individual responses aggregated. The solid line shows teams selected from the whole pool of participants and the dashed line shows teams selected from individuals achieving an average score of above one standard deviation on the session-one face tests. For comparison, crowd size of 1 is included, representing average performance of individuals. Details are provided in the main text. AUC area under the curve
Fig. 5
Fig. 5
Correlation between FR-Task accuracy of teams in sessions one and two (Analysis 4). Individual data points represent the accuracy achieved by averaging the responses of teams containing three individuals. See text for details
Fig. 6
Fig. 6
Regression analysis examining the factors underpinning stability in team accuracy across sessions one and two (Analysis 4). a Scatterplots showing team performance resulting from response aggregation (y axis) as a function of average accuracy of individual team members (x axis). Session one data are on the left scatterplot and session two on the right. b Scatterplot of residuals from the expected team accuracy based on linear regression in (a) for session one as a function of session two. Details of this analysis are provided in the main text

References

    1. Bindemann M, Avetisyan M, Rakow T. Who can recognize unfamiliar faces? Individual differences and observer consistency in person identification. Journal of Experimental Psychology: Applied. 2012;18(3):277–291. - PubMed
    1. Bobak AK, Bennetts RJ, Parris BA, Jansari A, Bate S. An in-depth cognitive examination of individuals with superior face recognition skills. Cortex. 2016;82:48–62. doi: 10.1016/j.cortex.2016.05.003. - DOI - PubMed
    1. Bobak AK, Dowsett AJ, Bate S. Solving the border control problem: evidence of enhanced face matching in individuals with extraordinary face recognition skills. PLoS One. 2016;11(2):e0148148. doi: 10.1371/journal.pone.0148148. - DOI - PMC - PubMed
    1. Bruce V, Henderson Z, Newman C, Burton AM. Matching identities of familiar and unfamiliar faces caught on CCTV images. Journal of Experimental Psychology: Applied. 2001;7(3):207–218. - PubMed
    1. Burton AM, White D, McNeill A. The Glasgow face matching test. Behavior Research Methods. 2010;42(1):286–291. doi: 10.3758/BRM.42.1.286. - DOI - PubMed

LinkOut - more resources