Scene Classification

Classifying images into two labels: indoor and outdoor
Used a pretrained resnet as the initial model.
For training data, some videos were collected from youtube-8M dataset based on the enitity labels provided;
The videos are downloaded using pytube and the frames are collected using openCV
The labelling is done based on the entity labels of the videos; But some frames manually were removed based on lack of relavence (eg. video title frames etc.) Total dataset has ~ 2010 images.
Train/validation/test split is done using a custom dataset class in pytorch, with 200 images in validation set and 200 in the test set.
The model is trained for 24 epochs; The training loss, validation loss and the validation set accuracy were plotted real time using tensorboardx.
The test accuracy (which can be calculated by running the command "python run_model.py" ) is 99.5%
Few test cases are added to check the model sanity; check model_tests.py
The final model is saved in the checkpoints folder.

To predict the label of a new image using the model, run "python run_model.py -i {path_to_image}"

chaitanyacsss/SceneClassification