Reimagining the machine learning life cycle to improve educational outcomes of students
- PMID: 36827260
- PMCID: PMC9992853
- DOI: 10.1073/pnas.2204781120
Reimagining the machine learning life cycle to improve educational outcomes of students
Abstract
Machine learning (ML) techniques are increasingly prevalent in education, from their use in predicting student dropout to assisting in university admissions and facilitating the rise of massive open online courses (MOOCs). Given the rapid growth of these novel uses, there is a pressing need to investigate how ML techniques support long-standing education principles and goals. In this work, we shed light on this complex landscape drawing on qualitative insights from interviews with education experts. These interviews comprise in-depth evaluations of ML for education (ML4Ed) papers published in preeminent applied ML conferences over the past decade. Our central research goal is to critically examine how the stated or implied education and societal objectives of these papers are aligned with the ML problems they tackle. That is, to what extent does the technical problem formulation, objectives, approach, and interpretation of results align with the education problem at hand? We find that a cross-disciplinary gap exists and is particularly salient in two parts of the ML life cycle: the formulation of an ML problem from education goals and the translation of predictions to interventions. We use these insights to propose an extended ML life cycle, which may also apply to the use of ML in other domains. Our work joins a growing number of meta-analytical studies across education and ML research as well as critical analyses of the societal impact of ML. Specifically, it fills a gap between the prevailing technical understanding of machine learning and the perspective of education researchers working with students and in policy.
Keywords: algorithmic fairness; education interventions; education technologies; machine learning for social good; problem formulation.
Conflict of interest statement
The authors declare no competing interest.
Figures

Similar articles
-
Impact of summer programmes on the outcomes of disadvantaged or 'at risk' young people: A systematic review.Campbell Syst Rev. 2024 Jun 13;20(2):e1406. doi: 10.1002/cl2.1406. eCollection 2024 Jun. Campbell Syst Rev. 2024. PMID: 38873396 Free PMC article. Review.
-
Predicting student success in MOOCs: a comprehensive analysis using machine learning models.PeerJ Comput Sci. 2024 Aug 23;10:e2221. doi: 10.7717/peerj-cs.2221. eCollection 2024. PeerJ Comput Sci. 2024. PMID: 39678289 Free PMC article.
-
The future of Cochrane Neonatal.Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834
-
Recovery schools for improving behavioral and academic outcomes among students in recovery from substance use disorders: a systematic review.Campbell Syst Rev. 2018 Oct 4;14(1):1-86. doi: 10.4073/csr.2018.9. eCollection 2018. Campbell Syst Rev. 2018. PMID: 37131375 Free PMC article.
-
The effects of small class sizes on students' academic achievement, socioemotional development and well-being in special education: A systematic review.Campbell Syst Rev. 2023 Jul 14;19(3):e1345. doi: 10.1002/cl2.1345. eCollection 2023 Sep. Campbell Syst Rev. 2023. PMID: 37457897 Free PMC article. Review.
Cited by
-
Grading by AI makes me feel fairer? How different evaluators affect college students' perception of fairness.Front Psychol. 2024 Feb 2;15:1221177. doi: 10.3389/fpsyg.2024.1221177. eCollection 2024. Front Psychol. 2024. PMID: 38371704 Free PMC article.
References
-
- Bennett J., Lanning S., et al. , “The Netflix prize” in Proceedings of KDD Cup and Workshop (New York, NY, 2007), vol. 2007, p. 35.
-
- Jelinek F., Statistical Methods for Speech Recognition (MIT Press, 1997).
-
- Bottou L., et al. , “Comparison of classifier methods: A case study in handwritten digit recognition” in Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3-Conference C: Signal Processing (Cat. No. 94CH3440-5) (IEEE, 1994), vol. 2, pp. 77–82.
-
- Shavlik J. W., Dietterich T., Dietterich T. G., Readings in Machine Learning (Morgan Kaufmann, 1990).
-
- Bird K. A., Castleman B. L., Mabel Z., Song Y., Bringing transparency to predictive analytics: A systematic comparison of predictive modeling methods in higher education. AERA Open 7, 23328584211037630 (2021).
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous