Temporal prediction of dropouts in MOOCs: reaching the low hanging fruit through stacking generalization

Xing, W, Chen, X, Stein, J and Marcinkowski, M (2016) 'Temporal prediction of dropouts in MOOCs: reaching the low hanging fruit through stacking generalization.' Computers in Human Behavior, 58. pp. 119-129. ISSN 0747-5632

Official URL: http://dx.doi.org/10.1016/j.chb.2015.12.007


Massive open online courses (MOOCs) have recently taken center stage in discussions surrounding online education, both in terms of their potential as well as their high dropout rates. The high attrition rates associated with MOOCs have often been described in terms of a scale-efficacy tradeoff. Building from the large numbers associated with MOOCs and the ability to track individual student performance, this study takes an initial step towards a mechanism for the early and accurate identification of students at risk for dropping out. Focusing on struggling students who remain active in course discussion forums and who are already more likely to finish a course, we design a temporal modeling approach, one which prioritizes the at-risk students in order of their likelihood to drop out of a course. In identifying only a small subset of at-risk students, we seek to provide systematic insight for instructors so they may better provide targeted support for those students most in need of intervention. Moreover, we proffer appending historical features to the current week of features for model building and to introduce principle component analysis in order to identify the breakpoint for turning off the features of previous weeks. This appended modeling method is shown to outperform simpler temporal models which simply sum features. To deal with the kind of data variability presented by MOOCs, this study illustrates the effectiveness of an ensemble stacking generalization approach to build more robust and accurate prediction models than the direct application of base learners.

Item Type: Article
Keywords: MOOC; dropout; prediction; algorithm; stacking; learning analytics
Subjects: L Education > L Education (General)
L Education > LB Theory and practice of education
L Education > LC Special aspects of education
Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Divisions: School of Writing, Publishing and the Humanities
Identification Number: https://doi.org/10.1016/j.chb.2015.12.007
Date Deposited: 06 Dec 2016 15:04
Last Modified: 06 Jan 2022 19:41
URI / Page ID: https://researchspace.bathspa.ac.uk/id/eprint/8562
Request a change to this item or report an issue Request a change to this item or report an issue
Update item (repository staff only) Update item (repository staff only)