/synthetic-data-generator

We will create a machine learning pipeline to generate time series and other types of datasets using GAN(Generative Adversarial Networks) and LSTM models from custom sample data.

Primary LanguageHTML

synthetic-data-generator

We will create a machine learning pipeline to generate time series and other types of datasets using GAN(Generative Adversarial Networks) and LSTM models from custom sample data.

NOTE: This repository is intended for learning and research purposes. It is also worth nothing that this is a WIP and will continue to be updated.

Experimentation Notebooks

Time Series dGAN Synthetic Data Generation

Open In Colab

LSTM Synthetic Data Generation

Open In Colab

Some examples - Comparison Reports

Pipeline (Coming Soon)

Getting started

  1. Install Anaconda

  2. Install conda environment and activate

    conda env create -f environment.yaml

    conda activate synthetic-data-generator

  3. Change into the main pipeline directory

    cd synthetic-data-pipeline

  4. Install the dependencies

    pip install -r src/requirements.txt

  5. Setup your IDE for Kedro projects

  6. Continue the next steps in this README

References