Pinned Repositories
awesome-mlops
A curated list of references for MLOps
bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
biobigbird
BigBird for bio-medical domain
boilerplate
NO MORE COPY/PASTING BOILERPLATE :)
ds-toolkit
Some useful stuff for a software/ML engineer
gpt-triton
Triton implementation of GPT/LLAMA
gsoc-wav2vec2
GSoC'2021 | TensorFlow implementation of Wav2Vec2
PaperHunt
Simple script for hunting trending papers everyday.
speech-jax
Speech in Flax/JAX
transformers-adapters
This repositary hosts my experiments for the project, I did with OffNote Labs.
thevasudevgupta's Repositories
thevasudevgupta/bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
thevasudevgupta/gpt-triton
Triton implementation of GPT/LLAMA
thevasudevgupta/ds-toolkit
Some useful stuff for a software/ML engineer
thevasudevgupta/biobigbird
BigBird for bio-medical domain
thevasudevgupta/accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
thevasudevgupta/cluster-health
thevasudevgupta/data-centric-ai
Resources for Data Centric AI
thevasudevgupta/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
thevasudevgupta/dsa-prep
Preparation material for getting strong grip on data structures & algorithms!!
thevasudevgupta/FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
thevasudevgupta/flash-attention
Fast and memory-efficient exact attention
thevasudevgupta/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
thevasudevgupta/fms-fsdp
Demonstrate throughput of PyTorch FSDP
thevasudevgupta/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
thevasudevgupta/gpt-llama.cpp
A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI.
thevasudevgupta/grok
Grok open release
thevasudevgupta/hyperpod
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
thevasudevgupta/ImageBind
ImageBind One Embedding Space to Bind Them All
thevasudevgupta/megablocks
thevasudevgupta/ml-engineering
Machine Learning Engineering Open Book
thevasudevgupta/nanotron
Minimalistic large language model 3D-parallelism training
thevasudevgupta/OLMo
Modeling, training, eval, and inference code for OLMo
thevasudevgupta/peft
Parameter-Efficient Fine-Tuning
thevasudevgupta/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
thevasudevgupta/text-generation-inference
Large Language Model Text Generation Inference
thevasudevgupta/thevasudevgupta
thevasudevgupta/thevasudevgupta.github.io
personal webpage (PUBLIC)
thevasudevgupta/torchtitan
A native PyTorch Library for large model training
thevasudevgupta/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
thevasudevgupta/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs