divyanshk's Stars
pytorch/data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
ServiceNow/Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
facebookexperimental/triton
Github mirror of trition-lang/triton repo.
Jokeren/Awesome-GPU
Awesome resources for GPUs
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
pytorch-labs/tritonbench
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
LambdaLabsML/distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
gpu-mode/resource-stream
GPU programming related news and material links
karpathy/llama2.c
Inference Llama 2 in one file of pure C
meta-llama/llama-stack
Composable building blocks to build Llama Apps
gpu-mode/lectures
Material for gpu-mode lectures
divyanshk/CodePlayList
a music playlist for ComputerGeeks :p
gpu-mode/awesomeMLSys
An ML Systems Onboarding list
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
rasbt/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used
karpathy/llm.c
LLM training in simple, raw C/CUDA
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
facebookresearch/generative-recommenders
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
pytorch/torchtune
PyTorch native post-training library
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
refuel-ai/autolabel
Label, clean and enrich text datasets with LLMs.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.