Pinned Repositories
nanotron
Minimalistic large language model 3D-parallelism training
ai-notebooks
AI notebooks
fastgoose
A PyTorch implementation of Model Parallelism and ZeRO Optimizer
instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
prodgpt
A production-ready training, evaluation and data pipeline
progen
Generating new proteins using language models
reinforcement-learning
stable-diffusion-from-scratch
Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]
toolformer
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
xrsrke's Repositories
xrsrke/instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
xrsrke/toolformer
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
xrsrke/pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
xrsrke/prodgpt
A production-ready training, evaluation and data pipeline
xrsrke/fastgoose
A PyTorch implementation of Model Parallelism and ZeRO Optimizer
xrsrke/homework
Notes
xrsrke/nanoGPT-msamp
Integrate MS-AMP into nanoGPT (https://github.com/karpathy/nanoGPT)
xrsrke/snippets
snippets
xrsrke/elasticgoose
A fault-tolerant elastic training framework for PyTorch
xrsrke/fsdl-megatron
Code for FSDL Breaking down parallelism in Megatron-LM
xrsrke/fsdl-website
Source for https://fullstackdeeplearning.com
xrsrke/hf-notebooks
xrsrke/Jetfire-INT8Training
xrsrke/megatron-tp
for debugging pipegoose
xrsrke/minitron
A mini Megatron 3D parallelism library for FSDL blog
xrsrke/mousai
PyTorch Implementation of Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
xrsrke/pipegoose-training
xrsrke/transformers-starcoder
xrsrke/xrsrke
xrsrke/hf-blog
Public repo for HF blog posts
xrsrke/internal
Mechanistic Interpretability's Tools
xrsrke/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
xrsrke/nmmo-baselines
Baselines for Neural MMO -- new users should treat this repo as a starter project
xrsrke/nmmo-environment
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
xrsrke/perceiver
Implementation of Perceiver: General Perception with Iterative Attention from DeepMind
xrsrke/prodgpt-data
Data Versioning for ProdGPT
xrsrke/prodgpt-dbt
xrsrke/vision-transformer
Pytorch implementation of An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
xrsrke/xrsrke.github.io
xrsrke/xrswtf
xrs.wtf