Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 15:209:118182.
doi: 10.1016/j.eswa.2022.118182. Epub 2022 Jul 28.

Predicting student performance using sequence classification with time-based windows

Affiliations

Predicting student performance using sequence classification with time-based windows

Galina Deeva et al. Expert Syst Appl. .

Abstract

A growing number of universities worldwide use various forms of online and blended learning as part of their academic curricula. Furthermore, the recent changes caused by the COVID-19 pandemic have led to a drastic increase in importance and ubiquity of online education. Among the major advantages of e-learning is not only improving students' learning experience and widening their educational prospects, but also an opportunity to gain insights into students' learning processes with learning analytics. This study contributes to the topic of improving and understanding e-learning processes in the following ways. First, we demonstrate that accurate predictive models can be built based on sequential patterns derived from students' behavioral data, which are able to identify underperforming students early in the course. Second, we investigate the specificity-generalizability trade-off in building such predictive models by investigating whether predictive models should be built for every course individually based on course-specific sequential patterns, or across several courses based on more general behavioral patterns. Finally, we present a methodology for capturing temporal aspects in behavioral data and analyze its influence on the predictive performance of the models. The results of our improved sequence classification technique are capable to predict student performance with high levels of accuracy, reaching 90% for course-specific models.

Keywords: Behavioral patterns; Feature engineering; Machine learning; Sequence mining; Success prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
An example of behavioral data and its transformations needed to perform a sequential data analysis.
Fig. 2
Fig. 2
An illustration of the sequence classification process with iBCM.
Fig. 3
Fig. 3
An example of the situation where the information regarding temporal aspect could be substantial for predicting student performance.
Fig. 4
Fig. 4
Top 15 most important sequential patterns found by iBCM based on mean decreased impurity of the best-performing Random Forest model built for the ENMAN303 dataset. The interpretation of the events is as follows: 0enrollment.activated, 1course.upgrade.displayed, 2ui.lms.link_clicked, 4edx.ui.lms.sequence.nextselected, 5load_video, 6page_close, 10stop_video, 11problem_check, 12problem_submitted, 13problem_graded, 15ui.lms.sequence.previous_selected, 16ui.lms.sequence.tab_selected, 19openassessment.upload_file.

References

    1. Agudo-Peregrina A.F., Iglesias-Pradas S., Conde-González M.A., Hernández-García A. Can we predict success from log data in VLEs? Classification of interactions for learning analytics and their relation with performance in VLE-supported F2F and online learning. Computers in Human Behavior. 2014;31:542–550.
    1. Celis S., Moreno L., Poblete P., Villanueva J., Weber R. Un modelo analıtico para la predicción del rendimiento académico de estudiantes de ingenierıa. Revista Ingenierıa de Sistemas Volumen XXIX. 2015
    1. Daud, A., Aljohani, N. R., Abbasi, R. A., Lytras, M. D., Abbas, F., & Alowibdi, J. S. (2017). Predicting student performance using advanced learning analytics. In Proceedings of the 26th international conference on world wide web companion (pp. 415–421).
    1. De Smedt J., Deeva G., De Weerdt J. Mining behavioral sequence constraints for classification. IEEE Transactions on Knowledge and Data Engineering. 2019;32(6):1130–1142.
    1. De Smedt J., Lacka E., Nita S., Kohls H.-H., Paton R. Session stitching using sequence fingerprinting for web page visits. Decision Support Systems. 2021;150

LinkOut - more resources