/Machine-Learning-Lab-Plaksha

Programming assignments covering fundamentals of machine learning and deep learning. These were completed as part of the Plaksha Tech Leaders Fellowship program.

Primary LanguageJupyter Notebook

Plaksha Machine Learning Lab Assignments

  • This repository contains programming assignments from the core subject Machine Learning, that were completed as part of the Plaksha Tech Leaders Fellowship program.

Table of Contents

  1. Instructions to use the repository
  2. List of Dependencies
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Instructions to use the repository

  1. Clone the repository to use it. https://github.com/Ankit-Kumar-Saini/Machine-Learning-Lab-Plaksha

List of Dependencies

The requirements.txt file list all the libraries/dependencies required to run the notebooks/projects in this repository.

File Descriptions

  1. Machine Learning: This folder contains machine learning assignments in the form of jupyter notebooks and datasets that were used to train the models. The details of the files/folders present inside this folder are as follows:

    1. Data Sets: This folder contains all the datasets that were used to complete the programming assignments.

    2. Jupyter Notebooks: The following jupyter notebooks are present in this folder

      • Lab1 Taylor Series & Contour Plots.ipynb: Code to approximate a given function at given points using Taylor Series expansion using numpy library.

      • Lab2 Loss Functions.ipynb: Implementation of different loss functions in numpy.

      • Lab3 Linear Models.ipynb: LinearRegression model training on random data using scikit-learn library.

      • Lab4 Logistic Regression.ipynb: LogisticRegression model to classify housing data into low and high price using scikit-learn library.

      • Lab5 Random Forest Model and MLP.ipynb: RandomForestClassifier and Multi Layer Perceptron (MLP) to classify MNIST digits using scikit-learn library.

      • Lab6 Ensemble Methods.ipynb: Training of Bagging and Voting classifiers using scikit-learn library.

      • Server Load Prediction.ipynb: Prediction of server load using RandomForestClassifier model.

  2. Deep Learning: This folder contains deep learning assignments in form of jupyter notebooks. All the notebooks are not part of the Plaksha curriculum. Few notebooks resulted from my personal interest to go deeper into the deep learning domain. The following jupyter notebooks are present in this folder

    • MNIST digit classification.ipynb: Building and training of CNN model from scratch to classify MNIST digits into 10 classes (0-9). Use of image data augmentation to avoid overfitting during training.

    • Transfer Learning.ipynb: Use of transfer learning to classify images into 11 categories by using the pre-trained ResNet50 model in TensorFlow.

    • VAE MNIST.ipynb: Training of CNN-based variational autoencoder on the MNIST digits dataset to generate new data in TensorFlow.

    • Cats_Dogs_Breed_Classification.ipynb: Use of transfer learning to classify images of cats and dog breeds into 37 categories using the pre-trained ResNet50 model with an F1-score of 99.3% in TensorFlow.

    • DCGAN MNIST.ipynb: Implementation of DCGAN to generate MNIST digits by training on the MNIST digits dataset in TensorFlow.

    • DCGAN CelebA.ipynb: Implementation of DCGAN to generate synthetic faces by training on the CelebA dataset in TensorFlow. Adding smiles on the generated faces by manipulating the generated images.

    • Fine Tune GPT2.ipynb: Text generation by fine-tuning GPT-2 model on the billsum dataset.

    • Molecule Generation LSTM.ipynb: Reconstruction of molecules using LSTM-based autoencoder by training on the SMILES dataset in TensorFlow.

  3. sample_images: This folder contains images of results taken from the jupyter notebooks in the Deep Learning section.

  4. README.md file: This file contains documentation of the repository.

Results

This section displays few results from the jupyter notebooks in form of images.

  1. Face images generated by the DCGAN during training on CelebA dataset

alt text

  1. Adding smiles to the generated face images

alt text

  1. MNIST digits generated by the DCGAN during training

alt text

  1. Text generation by fine-tuning GPT-2 on billsum dataset

alt text

  1. Molecules reconstruction using LSTM autoencoders (SMILES dataset)

alt text

  1. MNIST digits generation using CNN based Variational Autoencoder

alt text

  1. Visualization of latent space of Variational Autoencoder after training on MNIST digits dataset

alt text

  1. 2D Gaussian Distribution of latent space learned by the VAE for digit 0

alt text

  1. Classification results of cats and dogs breed on the test dataset

alt text

Licensing, Authors, Acknowledgements

Must give credit to Plaksha for the data and python 3 notebook.