Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 28;120(9):e2204781120.
doi: 10.1073/pnas.2204781120. Epub 2023 Feb 24.

Reimagining the machine learning life cycle to improve educational outcomes of students

Affiliations

Reimagining the machine learning life cycle to improve educational outcomes of students

Lydia T Liu et al. Proc Natl Acad Sci U S A. .

Abstract

Machine learning (ML) techniques are increasingly prevalent in education, from their use in predicting student dropout to assisting in university admissions and facilitating the rise of massive open online courses (MOOCs). Given the rapid growth of these novel uses, there is a pressing need to investigate how ML techniques support long-standing education principles and goals. In this work, we shed light on this complex landscape drawing on qualitative insights from interviews with education experts. These interviews comprise in-depth evaluations of ML for education (ML4Ed) papers published in preeminent applied ML conferences over the past decade. Our central research goal is to critically examine how the stated or implied education and societal objectives of these papers are aligned with the ML problems they tackle. That is, to what extent does the technical problem formulation, objectives, approach, and interpretation of results align with the education problem at hand? We find that a cross-disciplinary gap exists and is particularly salient in two parts of the ML life cycle: the formulation of an ML problem from education goals and the translation of predictions to interventions. We use these insights to propose an extended ML life cycle, which may also apply to the use of ML in other domains. Our work joins a growing number of meta-analytical studies across education and ML research as well as critical analyses of the societal impact of ML. Specifically, it fills a gap between the prevailing technical understanding of machine learning and the perspective of education researchers working with students and in policy.

Keywords: algorithmic fairness; education interventions; education technologies; machine learning for social good; problem formulation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
An extended ML life cycle diagram. The inner “ML Problem Box” represents the typical aspects of the ML problem detailed in the surveyed ML research papers. Our interview findings reveal the need to consider an extended version of the ML life cycle in ML research, including the initial problem formulation stage by practitioners and researchers and the translation from predictions to interventions that eventually impact stakeholders.

Similar articles

Cited by

References

    1. Bennett J., Lanning S., et al. , “The Netflix prize” in Proceedings of KDD Cup and Workshop (New York, NY, 2007), vol. 2007, p. 35.
    1. Jelinek F., Statistical Methods for Speech Recognition (MIT Press, 1997).
    1. Bottou L., et al. , “Comparison of classifier methods: A case study in handwritten digit recognition” in Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3-Conference C: Signal Processing (Cat. No. 94CH3440-5) (IEEE, 1994), vol. 2, pp. 77–82.
    1. Shavlik J. W., Dietterich T., Dietterich T. G., Readings in Machine Learning (Morgan Kaufmann, 1990).
    1. Bird K. A., Castleman B. L., Mabel Z., Song Y., Bringing transparency to predictive analytics: A systematic comparison of predictive modeling methods in higher education. AERA Open 7, 23328584211037630 (2021).