This is the course project for 02-620: Machine Learning For Scientist in Spring 2022. And oour project is about cancer classification using machine learning.
- Yuxuan Wu
- Eric Li
- Yifan Wu
- Xin Wang
- Python3
- scikit-learn
- pandas
- numpy
- pytorch
All jupyter notebooks are self-contained and runnable!
We used R to perform Differential Gene Expression (DGE), which is one of our feature selection methods. And the other is selectKBest
function in Python.
R scripts are under ./Preprocessing_and_DGE
self-implemented decision tree model: ./Model/Decision_tree/decision_tree.ipynb
self-implemented adaboost model: ./Model/AdaBoost/adaboost.ipynb
./Model/MLP
./Performance
./Results/Performance
./Feature_importance
./Results/Feature_importance
./Additional_plots