build.py creates a linear regression model that predicts grades of students based on the numerical characteristics from the student-mat.csv dataset. build.py then saves this model to topModel.pickle .
python 3.7, numpy, pandas, sklearn, and matplotlib.
https://archive.ics.uci.edu/ml/datasets/Student+Performance
Please include this citation if you plan to use this database:
P. Cortez and A. Silva. Using Data Mining to Predict Secondary School Student Performance. In A. Brito and J. Teixeira Eds., Proceedings of 5th FUture BUsiness TEChnology Conference (FUBUTEC 2008) pp. 5-12, Porto, Portugal, April, 2008, EUROSIS, ISBN 978-9077381-39-7. Paper Web Link: http://www3.dsi.uminho.pt/pcortez/student.pdf