prrathi's Stars
anshgs/earthgen
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
koekeishiya/yabai
A tiling window manager for macOS based on binary space partitioning
HigherOrderCO/HVM
A massively parallel, optimal functional runtime in Rust
google-deepmind/xmanager
A platform for managing machine learning experiments
meta-llama/llama3
The official Meta Llama 3 GitHub site
Schwidola0607/awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
kingoflolz/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
google/maxtext
A simple, performant and scalable Jax LLM!
stas00/ml-engineering
Machine Learning Engineering Open Book
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
colesbury/nogil
Multithreaded Python without the GIL
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
openxla/stablehlo
Backward compatible ML compute opset inspired by HLO/MHLO
kuangliu/pytorch-cifar
95.47% on CIFAR10 with PyTorch
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
justinpinkney/stable-diffusion
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
RUCAIBox/RecBole
A unified, comprehensive and efficient recommendation library
openxla/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
chroma-core/chroma
the AI-native open-source embedding database
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Pythagora-io/gpt-pilot
The first real AI developer
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
remzi-arpacidusseau/ostep-projects
Projects for an undergraduate OS course
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
triton-lang/triton
Development repository for the Triton language and compiler