fy2015-replication

This repository contains the complete configuration files necessary for the replication of Fei and Yeung (2015), "Temporal Models for Predicting Student Dropout in Massive Open Online Courses" using the MOOC Replication Framework (MORF). The complete results of this replication are described in Gardner, Yang, Baker, and Brooks (2018), "Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining."

Guide to the contents of this repo:

docker: contains dockerfile and necessary scripts to build the docker image. This image can also be pulled directly from docker cloud by running the following in a terminal (note Docker must be installed):

docker pull themorf/morf-public:fy2015-replication

config: contains two subdirectories, holdout and cv, with configuration files to reproduce the experiment using the holdout and cross-validation architectures, respectively. Note that weeks are zero-indexed (so week_0 actually uses one week of features, and week_4 uses weeks one through five, utilizing the method described in the original Fei and Yeung paper).

Executing the experiments described in this repo:

To execute one of the trials described here (where a trial is a specific model evaluated with features up to a specific week number), use the MORF API functions:

from morf.utils.submit import easy_submit
easy_submit(client_config_url="https://raw.githubusercontent.com/educational-technology-collective/fy2015-replication/master/config/holdout/week_4/svm/controller.py", email_to="your-email@example.com")

Note that the complete extraction-training-testing pipeline may take several hours. Also note that if you are using a job which utilizes fork_features(), the job it is forking from must be executed first.

Each experiment also includes a persistent Digital Object Identifier which contains links to the client.config and controller scripts, which, along with the Docker image described above (which is common to all of the trials), fully reproduces every trial of the experiment.

Experiment	Week	Model	Zenodo Deposition ID
holdout	0	LR	1275035
holdout	0	RNN	1275045
holdout	0	SVM	1275193
holdout	0	LSTM	1275041
holdout	1	LR	1275049
holdout	1	LSTM	1275055
holdout	1	RNN	1275059
holdout	1	SVM	1275197
holdout	2	LR	1275063
holdout	2	LSTM	1275071
holdout	2	RNN	1275074
holdout	2	SVM	1275201
holdout	3	LR	1275077
holdout	3	RNN	1275081
holdout	3	LSTM	1275083
holdout	3	SVM	1275203
holdout	4	LR	1275331
holdout	4	RNN	1275335
holdout	4	LSTM	1275339
holdout	4	SVM	1275341
cv	0	LR	1275087
cv	0	RNN	1275091
cv	0	LSTM	1275095
cv	0	SVM	1275207
cv	1	LR	1275101
cv	1	RNN	1275103
cv	1	LSTM	1275107
cv	1	SVM	1275211
cv	2	LR	1275113
cv	2	RNN	1275119
cv	2	LSTM	1275121
cv	2	SVM	1275213
cv	3	LR	1275129
cv	3	RNN	1275133
cv	3	LSTM	1275135
cv	3	SVM	1275215
cv	4	LR	1275345
cv	4	RNN	1275347
cv	4	LSTM	1275351
cv	4	SVM	1275355

taozeze/fy2015-replication

fy2015-replication

Guide to the contents of this repo:

Executing the experiments described in this repo: