Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022;27(3):3891-3933.
doi: 10.1007/s10639-021-10751-5. Epub 2021 Oct 11.

Towards teaching analytics: a contextual model for analysis of students' evaluation of teaching through text mining and machine learning classification

Affiliations

Towards teaching analytics: a contextual model for analysis of students' evaluation of teaching through text mining and machine learning classification

Kingsley Okoye et al. Educ Inf Technol (Dordr). 2022.

Abstract

Recent trends in educational technology have led to emergence of methods such as teaching analytics (TA) in understanding and management of the teaching-learning processes. Didactically, teaching analytics is one of the promising and emerging methods within the Education domain that have proved to be useful, towards scholastic ways to make use of substantial pieces of evidence drawn from educational data to improve the teaching-learning processes and quality of performance. For this purpose, this study proposed an educational process and data mining plus machine learning (EPDM + ML) model applied to contextually analyze the teachers' performances and recommendations based on data derived from students' evaluation of teaching (SET). The EPDM + ML model was designed and implemented based on amalgamation of the Text mining and Machine learning technologies that builds on the descriptive decision theory, which studies the rationality behind decisions the learners are disposed to make based on the textual data quantification and statistical analysis. To this effect, the study determines pedagogical factors that influences the students' recommendations for their teachers, what role the sentiment and emotions expressed by the students in the SET play in the way they evaluate the teachers by taking into account the gender of the teachers. This includes how to automatically predict what a student's recommendation for the teachers may be based on information about the students' gender, average sentiment, and emotional valence they have shown in the SET. Practically, we applied the Text mining technique to extract the different sentiments and emotions (intensities of the comments) expressed by the students in the SET, and then utilized the quantified data (average sentiment and emotional valence) to conduct an analysis of covariance and Kruskal Wallis Test to determine the influential factors, as well as, how the students' recommendation for the teachers differ by considering the gender constructs, respectively. While a large proportion of the comments that we analyzed (n = 85,378) was classified to be neutral and predominantly interpreted to be positive in nature considering the sentiments (76.4%), and emotional valence (88.2%) expressed by the students. The results of our analysis shows that for the students' comments which contain some kind of positive or negative sentiment (23.6%) and emotional valence (11.8%); that females students recommended the teachers taking into account the sentiments (p = .000). While the males appear to be slightly borderline in terms of emotions (p = .056) and sentiment (p = .077). Also, the EPDM + ML model showed to be a good predictor and efficient method in determining what the students' recommendation scores for the teachers would be, going by the high and acceptable values of the precision (1.00), recall (1.00), specificity (1.00), accuracy (1.00), F1-score (1.00) and zero error-rate (0.00) which we validated using the k-fold cross-validation method, with 63.6% of optimal k-values observed. In theory, we note that not only does the proposed method (EPDM + ML) proves to be useful towards effective analysis of SET and its implications within the educational domain. But can be utilized to determine prominent factors that influences the students' evaluation and recommendation of the teachers, as well as helps provide solutions to the ever-increasingly need to advance and support the teaching-learning processes and/or students' learning experiences in a rapidly changing educational environment or ecosystem.

Keywords: Educational innovation; Higher education; Machine learning; Performance assessment; Teaching analytics; Text mining.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestsThe authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Educational process and data mining plus machine learning (EPDM + ML) model
Fig. 2
Fig. 2
Comments (counts) vs. Ave_Sentiment score broken down by students gender
Fig. 3
Fig. 3
Emotional valence scores for the SET comments broken down by students gender
Fig. 4
Fig. 4
Overall Emotions expressed by the students for the teachers broken down by gender of the students
Fig. 5
Fig. 5
Emotions expressed by the male students for the male vs female teachers
Fig. 6
Fig. 6
Emotions expressed by the female students for the male vs female teachers

References

    1. Abu Alfeilat HA, Hassanat ABA, Lasassmeh O, Tarawneh AS, Alhasanat MB, Eyal Salman HS, Prasath VBS. Effects of distance measure choice on k-nearest neighbor classifier performance: A review. Big Data. 2019;7(4):221–248. doi: 10.1089/big.2018.0175. - DOI - PubMed
    1. Abu Zohair LM. Prediction of Student’s performance by modelling small dataset size. International Journal of Educational Technology in Higher Education. 2019 doi: 10.1186/s41239-019-0160-3. - DOI
    1. Al-Maskari A, Al-Riyami T, Kunjumuhammed SK. Students academic and social concerns during COVID-19 pandemic. Education and Information Technologies. 2021 doi: 10.1007/s10639-021-10592-2. - DOI - PMC - PubMed
    1. Alao VM, Lansangan JRG, Barrios EB. Estimation of semiparametric mixed analysis of covariance model. Communications in Statistics Simulation and Computation. 2019 doi: 10.1080/03610918.2019.1694152. - DOI
    1. Aldowah H, Al-Samarraie H, Fauzy WM. Educational data mining and learning analytics for 21st century higher education: A review and synthesis. Telematics and Informatics. 2019;37(April 2018):13–49. doi: 10.1016/j.tele.2019.01.007. - DOI

LinkOut - more resources