thevasudevgupta
trying to learn what AI learns | building AGI for consumption
@Unbox-AINew Delhi, India
Pinned Repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
awesome-mlops
A curated list of references for MLOps
bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
boilerplate
NO MORE COPY/PASTING BOILERPLATE :)
ds-toolkit
Some useful stuff for a software/ML engineer
gpu-programming
GPU Programming @ IIT Madras
gsoc-wav2vec2
GSoC'2021 | TensorFlow implementation of Wav2Vec2
PaperHunt
Simple script for hunting trending papers everyday.
speech-jax
Speech in Flax/JAX
transformers-adapters
This repositary hosts my experiments for the project, I did with OffNote Labs.
thevasudevgupta's Repositories
thevasudevgupta/bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
thevasudevgupta/speech-jax
Speech in Flax/JAX
thevasudevgupta/ds-toolkit
Some useful stuff for a software/ML engineer
thevasudevgupta/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
thevasudevgupta/biobigbird
BigBird for bio-medical domain
thevasudevgupta/accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
thevasudevgupta/d2v
Data2Vec style pretraining
thevasudevgupta/data-centric-ai
Resources for Data Centric AI
thevasudevgupta/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
thevasudevgupta/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
thevasudevgupta/dsa-prep
Preparation material for getting strong grip on data structures & algorithms!!
thevasudevgupta/FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
thevasudevgupta/flash-attention
Fast and memory-efficient exact attention
thevasudevgupta/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
thevasudevgupta/fms-fsdp
Demonstrate throughput of PyTorch FSDP
thevasudevgupta/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
thevasudevgupta/grok
Grok open release
thevasudevgupta/hyperpod
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
thevasudevgupta/ImageBind
ImageBind One Embedding Space to Bind Them All
thevasudevgupta/megablocks
thevasudevgupta/ml-engineering
Machine Learning Engineering Open Book
thevasudevgupta/nanotron
Minimalistic large language model 3D-parallelism training
thevasudevgupta/OLMo
Modeling, training, eval, and inference code for OLMo
thevasudevgupta/peft
Parameter-Efficient Fine-Tuning
thevasudevgupta/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
thevasudevgupta/text-generation-inference
Large Language Model Text Generation Inference
thevasudevgupta/thevasudevgupta
thevasudevgupta/transformers-bloom-inference
Fast Inference Solutions for BLOOM
thevasudevgupta/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
thevasudevgupta/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs