/white-paper-replications

Replicating GenAI/DL white papers in pytorch to learn and have fun

Primary LanguageJupyter NotebookMIT LicenseMIT

White Paper Replicas

Welcome to the White Paper Replications repository! This project is dedicated to replicating state-of-the-art AI research from influential white papers, primarily using PyTorch and Python.

Disclaimer: The code is written by me, so it's not perfect. It's a combination of my own insights and parts I found in different resources.

Planning on adding these in the future SAM, RoPE, FlashAttention, Whisper, UNET, DDDM, DDIM, BLOOM

Overview

In this repository, you'll find implementations of various AI models and algorithms as described in prominent research papers. The goal is to better understand them by coding and trying to make them as intuitive as possible.

Available Replicas

Here's a list of the white papers and their corresponding implementations available in this repository:

Each one contains:

  • README.md: Detailed instructions on how to use the code for that specific paper.
  • requirements.txt: Python dependencies required for that implementation.

Table of Contents

Getting Started

To get started with the code in this repository, follow these steps:

  1. Clone the Repository

    git clone https://github.com/Vicba/white-paper-replications.git
    cd white-paper-replications

Requirements

To run the code, you'll need:

  • Python

Usage

To view/use the code of a specific paper, navigate to the corresponding directory and follow the instructions in the README.md file there.

Contributing

Contributions are welcome! If you'd like to add new replicas or improve existing ones, please follow these steps:

  1. Fork the repository.
  2. Create a new branch (git checkout -b feature-branch).
  3. Make your changes.
  4. Commit your changes (git commit -am 'Add <new replica>').
  5. Push to the branch (git push origin feature-branch).
  6. Open a pull request.

Please ensure your contributions adhere to the project's coding standards and include a test if applicable.

License

This project is licensed under the MIT License. See the LICENSE file for details.