The aim of this project is to let a high school student to get basic knowledges of maching learning techniques, and try to apply them on the real world dataset.
There is a real world problem: sometimes we have handwrite numbers, like phone numbers, like credit card number. It is a tedious work to type those numbers on keyboard, to input into the computer. Now we want to build a machine learning model, which can recognize the vague handwritten numbers, and automatically change it to digital numbers.
This is a kind of issue which can be solved by machine learning techniques. We have a lot of real world problems, as long as we have the dataset, we can train a machine learning model to solve them.
- Learn the basic concepts about machine learning: dataset, model, training, predicting, and the metrics to evaluate the performace of a maching learning model.
- Learn how to train a machine learning model by Python + Scikit Learn.
- Learn the basic principle of several classical machine learning algorithms, such as SVM, KNN, Naive Bayes, Decision Tree and Random forest, etc.
- Learn how to compare the classification result by human-readable plot, by using matplotlib.
- Given the data set from https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_digits.html#sklearn.datasets.load_digits, train a machine learning model based on the testset, and do a prediction on the testset.
- Compare the results of different machine learning algorithms on this task. Which one is the most accurate? Which one is the fastest in training and which one is the fastest in predicting?
- Using matplotlib, draw above comparison into graphs/plots.
- Pick one Kaggle contest, and acheive some result at the end of the project, by using he skills learned from the above 3 steps.
- https://www.jetbrains.com/pycharm/download/#section=mac
- https://brew.sh/
- https://scikit-learn.org/stable/
- https://www.youtube.com/watch?v=KTeVOb8gaD4
- https://www.youtube.com/watch?v=q7Bo_J8x_dw&list=PLQVvvaa0QuDfefDfXb9Yf0la1fPDKluPF
- https://scikit-learn.org/stable/auto_examples/index.html#classification
- https://www.kaggle.com/