install all the environment based on the environment.yml
file using conda or others
conda env create -f environment.yml -p ./env
- obtain your kaggle api from kaggle website.
- open
eda.ipynb
run all commands and enter your kaggle's username and password.
- Clone the
sample.ipynb
with your name<name>.ipynb
. - Code review, the first section is data preprocessing, involves
- Resizing
- Normalization [0, 255] -> [0, 1]
- Data Augmentation i. horizontal flip ii. rotation
- test all functions is working, run all the entire file
- if no error, remember search 'REMARK' and comment out the testing part which is reducing the dataloader for testing purpose
### REMARK: for testing purposes, reduce the dataset to 10% of the original size
train_dataset.dataset.samples = train_dataset.dataset.samples[:int(len(train_dataset)*0.1)]
test_dataset.dataset.samples = test_dataset.dataset.samples[:int(len(test_dataset)*0.1)]
valid_dataset.dataset.samples = valid_dataset.dataset.samples[:int(len(valid_dataset)*0.1)]
len(train_dataset), len(test_dataset), len(valid_dataset)
- remember save every trained model and history, screenshot all the training and evaluation as well.
- remember rename each trained model version to avoid overwritting.
- run
tensorboard --logdir=runs
in terminal to view the accuracy and loss result.