/SeekDeeper

The minimal implementation of various popular AI models

Primary LanguageJupyter NotebookMIT LicenseMIT

SeekDeeper: Minimal Implementations of Popular AI Models

[πŸ“–δΈ­ζ–‡ReadMe]

Motivation

Official code repositories often include many engineering details, which can be overwhelming for beginners. This repository aims to implement various models with as little code as possible using PyTorch, making it easier for learners to understand and reproduce results. Additionally, most tutorials lack a complete workflow, focusing only on the model without considering data loading and training. This makes it difficult for beginners to apply their knowledge in practice.

Models

Model Paper Official or Reference Repository
Transformer Attention Is All You Need https://github.com/hyunwoongko/transformer
GPT Improving Language Understanding by Generative Pre-Training https://github.com/openai/gpt-2
GAN Generative Adversarial Networks https://github.com/goodfeli/adversarial
DCGAN Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks https://github.com/Newmu/dcgan_code
WGAN-GP Improved Training of Wasserstein GANs https://github.com/igul222/improved_wgan_training

Directory Structure

For each model, the typical directory structure is as follows:

<model name>/
β”œβ”€β”€ checkpoints/
β”œβ”€β”€ components/
β”œβ”€β”€ datasets/
β”œβ”€β”€ images/
β”œβ”€β”€ README.md
β”œβ”€β”€ data.py
β”œβ”€β”€ config.py
β”œβ”€β”€ train.ipynb
└── inference.ipynb
  • checkpoints/: Contains pre-trained model weights for direct use in inference.ipynb. Sometimes, pre-trained parameters from official repositories are loaded directly.
  • components/: Contains components necessary for model implementation.
  • datasets/: Contains datasets required for training or inference validation, which may sometimes be downloaded to this directory via code.
  • images/: Contains datasets required for training or inference validation, which may sometimes be downloaded to this directory via code.
  • README.md: Introduces the implemented task and describes the implementation details.
  • data.py: Defines Dataset, Dataloader, or data preprocessing.
  • config.py: Defines hyperparameters needed for the experiment.
  • train.ipynb: Clearly presents the process from data loading, preprocessing, to training and evaluation.
  • inference.ipynb: Loads model parameters from the checkpoints/ directory for inference.

License

This project uses the MIT License.