Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 11;121(24):e2322973121.
doi: 10.1073/pnas.2322973121. Epub 2024 Jun 4.

The origins of unpredictability in life outcome prediction tasks

Affiliations

The origins of unpredictability in life outcome prediction tasks

Ian Lundberg et al. Proc Natl Acad Sci U S A. .

Abstract

Why are some life outcomes difficult to predict? We investigated this question through in-depth qualitative interviews with 40 families sampled from a multidecade longitudinal study. Our sampling and interviewing process was informed by the earlier efforts of hundreds of researchers to predict life outcomes for participants in this study. The qualitative evidence we uncovered in these interviews combined with a mathematical decomposition of prediction error led us to create a conceptual framework. Our specific evidence and our more general framework suggest that unpredictability should be expected in many life outcome prediction tasks, even in the presence of complex algorithms and large datasets. Our work provides a foundation for future empirical and theoretical work on unpredictability in human lives.

Keywords: life course; limits to prediction; machine learning; mixed methods; prediction.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Life outcome prediction task. In the task we studied, the feature observation window is from the child’s birth to age 9. The intervening period follows from age 9 to 15. The outcome time is age 15. The chosen features are 12,942 survey responses. The chosen outcome is self-reported grade point average (GPA), which ranges from 1.00 (worst) to 4.00 (best).
Fig. 2.
Fig. 2.
Origins of unpredictability: irreducible error and learning error. See also figure 1.6 in ref. . Each dot represents a person; people who share a single value on the predictor have many outcomes (each vertical set of dots). This type of error is fixed by the task definition, so we call it irreducible error. Second, the estimated prediction function (regression line) is not equal to the true conditional expectation in each subgroup defined by X. The prediction is wrong in this case because the true relationship is not linear, but it could also be wrong due to sampling variability or bias. Because this component relates to the learning procedure, we call it learning error.
Fig. 3.
Fig. 3.
Inputs to irreducible error and learning error.
Fig. 4.
Fig. 4.
Sources of irreducible error.
Fig. 5.
Fig. 5.
Generalizing to other life outcome prediction tasks.

Comment in

Similar articles

Cited by

References

    1. Van Creveld M., Seeing Into the Future: A Short History of Prediction (Reaktion Books, 2020).
    1. Reichman N. E., Teitler J. O., Garfinkel I., McLanahan S. S., Fragile Families: Sample and design. Child. Youth Serv. Rev. 23, 303–326 (2001).
    1. Salganik M. J., et al. , Measuring the predictability of life outcomes with a scientific mass collaboration. Proc. Natl. Acad. Sci. U.S.A. 117, 8398–8403 (2020). - PMC - PubMed
    1. Kleinberg J., Lakkaraju H., Leskovec J., Ludwig J., Mullainathan S., Human decisions and machine predictions. Q. J. Econ. 133, 237–293 (2018). - PMC - PubMed
    1. Chouldechova A., Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data 5, 153–163 (2017). - PubMed

LinkOut - more resources