proceduralia

Mila

proceduralia's Stars

donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python260k 6.7k 28344.2k
OpenInterpreter/open-interpreter
A natural language interface for computers
Language:Python49.5k 372 8644.3k
charlax/professional-programming
A collection of learning resources for curious software engineers
Language:Python45.7k 979 273.6k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook32.8k 342 613.4k
maybe-finance/maybe
The OS for your personal finances
Language:Ruby27.2k 141 2502.1k
bloomberg/memray
Memray is a memory profiler for Python
Language:Python12.7k 61 170370
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python10.1k 103 18592
TencentARC/PhotoMaker
PhotoMaker
Language:Jupyter Notebook8.5k 98 122665
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python7.2k 100 1.4k844
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.7k 25 33241
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python2.6k 28 260166
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2k 21 297154
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook1.7k 16 27170
google/maxtext
A simple, performant and scalable Jax LLM!
Language:Python1.3k 23 62229
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python899 41 5979
mistralai/megablocks-public
Language:Python857 9 051
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Language:Python727 8 4156
ezelikman/quiet-star
Code for Quiet-STaR
Language:Python310 11 752
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python288 6 1018
carlosferrazza/humanoid-bench
Language:Python269 7 624
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python238 4 4225
facebookincubator/dynolog
Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
Language:C++180 14 2033
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python141 6 1214
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Language:Python67 5 910
facebookresearch/RLCD
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
Language:Python57 7 33
roeehendel/icl_task_vectors
Language:Python49 1 513
facebookresearch/rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
Language:Python30 4 32
deeplearning-wisc/args
Language:Python22 3 43
Jiuzhouh/Uncertainty-Aware-Language-Agent
This is the official repo for Towards Uncertainty-Aware Language Agent.
Language:Python13 1 01
spikedoanz/weenygrad
Minimalist vector AD
Language:Python9 1 01

proceduralia

proceduralia's Stars

donnemartin/system-design-primer

OpenInterpreter/open-interpreter

charlax/professional-programming

mlabonne/llm-course

maybe-finance/maybe

bloomberg/memray

stas00/ml-engineering

TencentARC/PhotoMaker

huggingface/accelerate

eureka-research/Eureka

sgl-project/sglang

webdataset/webdataset

ysymyth/ReAct

google/maxtext

huggingface/nanotron

mistralai/megablocks-public

tatsu-lab/alpaca_farm

ezelikman/quiet-star

RLHFlow/RLHF-Reward-Modeling

carlosferrazza/humanoid-bench

allenai/reward-bench

facebookincubator/dynolog

SalesforceAIResearch/DiffusionDPO

YifeiZhou02/ArCHer

facebookresearch/RLCD

roeehendel/icl_task_vectors

facebookresearch/rlfh-gen-div

deeplearning-wisc/args

Jiuzhouh/Uncertainty-Aware-Language-Agent

spikedoanz/weenygrad