Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 6;120(23):e2301990120.
doi: 10.1073/pnas.2301990120. Epub 2023 May 30.

Cohort bias in predictive risk assessments of future criminal justice system involvement

Affiliations

Cohort bias in predictive risk assessments of future criminal justice system involvement

Erika Montana et al. Proc Natl Acad Sci U S A. .

Abstract

Risk assessment instruments (RAIs) are widely used to aid high-stakes decision-making in criminal justice settings and other areas such as health care and child welfare. These tools, whether using machine learning or simpler algorithms, typically assume a time-invariant relationship between predictors and outcome. Because societies are themselves changing and not just individuals, this assumption may be violated in many behavioral settings, generating what we call cohort bias. Analyzing criminal histories in a cohort-sequential longitudinal study of children, we demonstrate that regardless of model type or predictor sets, a tool trained to predict the likelihood of arrest between the ages of 17 and 24 y on older birth cohorts systematically overpredicts the likelihood of arrest for younger birth cohorts over the period 1995 to 2020. Cohort bias is found for both relative and absolute risks, and it persists for all racial groups and within groups at highest risk for arrest. The results imply that cohort bias is an underappreciated mechanism generating inequality in contacts with the criminal legal system that is distinct from racial bias. Cohort bias is a challenge not only for predictive instruments with respect to crime and justice, but also for RAIs more broadly.

Keywords: bias; cohort; criminal justice; risk assessment; social change.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
ROC curves showing performance of logistic regression models. Models were trained on the older cohort. Performance on the older cohort is shown in red, while performance on the younger cohort is shown in blue. Results are shown for both (A) an unregularized logistic regression using the classic risk-factor feature set and (B) a lasso logistic regression using the full feature set.
Fig. 2.
Fig. 2.
Calibration plots for models trained on older cohort. Calibration plots for the classic risk-factor unregularized logistic regression model (Top) and the full lasso logistic regression model (Bottom). Models were trained on the older cohort, and performance on the older cohort is shown on the Left, while performance on the younger cohort is shown on the Right. The regression line falls below the ideal 45° line indicating persistent overprediction of the likelihood of arrest for the younger cohort in both models.
Fig. 3.
Fig. 3.
Comparison of rankings between full lasso logistic regression models trained on different cohorts. Plot of younger cohort, risk rankings produced by the full lasso logistic regression model trained on the older cohort (x axis) and the same model trained on the younger cohort (y axis). Both the plot and the Spearman’s correlation value of 0.76 show that while the two sets of rankings are correlated, there are substantial ranking differences between the model produced by the older cohort and that produced by the younger cohort.
Fig. 4.
Fig. 4.
Calibration plots for classic risk-factor logistic regression model trained on older cohort by race. Performance on the older cohort of the classic risk-factor logistic regression model trained on the older cohort is shown in red, and performance on the younger cohort is shown in blue. Systematic overprediction of the likelihood of arrest is observed across racial groups.
Fig. 5.
Fig. 5.
Calibration plots for models including an adolescent arrest history predictor and trained on older cohort. Calibration plots for two models which include a binary variable indicating arrest at age 17 or 18 y: the classic risk-factor logistic regression model (Top) and the full lasso regularized logistic regression model (Bottom). The left column shows the performance of these models on the older cohort on which they were trained. The right column shows the performance of the same models on the younger cohort. When arrest history is included as a predictor, models still overpredict arrest likelihood for the younger cohort.
Fig. 6.
Fig. 6.
Calibration plots for models trained on older cohort applied to individuals who are two SDs below the average on self-control. The Top row shows the calibration of the classic risk-factor logistic regression model with performance for the older cohort on the Left and performance for the younger cohort on the Right. The Bottom row shows the performance of full lasso logistic regression model. The deviation of the regression line from the diagonal in the right column indicates that when considering only individuals with low self-control, cohort bias is observed.

Comment in

References

    1. Leist A. K., et al. , Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences. Sci. Adv. 8, eabk1942 (2022). - PMC - PubMed
    1. Editors, Description, prediction, explanation. Nat. Hum. Behav. 5, 1261–1261 (2021). - PubMed
    1. El Morr C., Ginsburg L., Nam V.-S., Woollard S., Hansen B., Analyzing readmissions patterns: Assessment of the LACE tool impact. Stud. Health Technol. Inf. 223, 25–30 (2016). - PubMed
    1. Chouldechova A., Benavides-Prado D., Fialko O., Vaithianathan R., A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions. Proc. Machine Learn. Res. 81, 134–148 (2018).
    1. Rotaru V., Huang Y., Li T., Evans J., Chattopadhyay I., Event-level prediction of urban crime reveals a signature of enforcement bias in us cities. Nat. Hum. Behav. 6, 1056–1068 (2022). - PubMed

Publication types

LinkOut - more resources