mgerstgrasser

Stanford CSPalo Alto, CA

Pinned Repositories

CrowdPlay
A web based platform for collecting human actions in reinforcement learning environments
Language:Jupyter Notebook26 3 13
aArtisanAutotune
A PID Autotuning sketch for aArtisan.
Language:C++34
flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda00
Multi-Agent-ALE
The Arcade Learning Environment (ALE) -- a platform for AI research.
Language:C++00
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Language:Python00
oracles_and_followers
Code for the ICML 2023 paper "Oracles & Followers"
Language:Python51
ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python0 0 00
super
suPER is a collaborative multi-agent RL algorithm
Language:Python11 2 11
tacheles
a lightweight, open-source blueprint for building powerful and scalable LLM chat applications
Language:JavaScript30 3 02
tc4plus-coffee-roaster-shield
The TC4+ is improving and expanding on the TC4 Arduino shield for DIY (and commercial) coffee roasters.
176

mgerstgrasser's Repositories

mgerstgrasser/tacheles
a lightweight, open-source blueprint for building powerful and scalable LLM chat applications
Language:JavaScript30 3 02
mgerstgrasser/tc4plus-coffee-roaster-shield
The TC4+ is improving and expanding on the TC4 Arduino shield for DIY (and commercial) coffee roasters.
176
mgerstgrasser/super
suPER is a collaborative multi-agent RL algorithm
Language:Python11 2 11
mgerstgrasser/oracles_and_followers
Code for the ICML 2023 paper "Oracles & Followers"
Language:Python51
mgerstgrasser/aArtisanAutotune
A PID Autotuning sketch for aArtisan.
Language:C++34
mgerstgrasser/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda00
mgerstgrasser/Multi-Agent-ALE
The Arcade Learning Environment (ALE) -- a platform for AI research.
Language:C++00
mgerstgrasser/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Language:Python00
mgerstgrasser/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python0 0 00
mgerstgrasser/slurm-dashboard
Slurm Dashboard VSCode extension
Language:TypeScript0 0 00
mgerstgrasser/TC4-shield
Language:C++0 2 03
mgerstgrasser/trl
Train transformer language models with reinforcement learning.
Language:Python0 0 00
mgerstgrasser/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python0 0
mgerstgrasser/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
mgerstgrasser/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0

mgerstgrasser

Pinned Repositories

CrowdPlay

aArtisanAutotune

flashinfer

Multi-Agent-ALE

OpenRLHF

oracles_and_followers

ray

super

tacheles

tc4plus-coffee-roaster-shield

mgerstgrasser's Repositories

mgerstgrasser/tacheles

mgerstgrasser/tc4plus-coffee-roaster-shield

mgerstgrasser/super

mgerstgrasser/oracles_and_followers

mgerstgrasser/aArtisanAutotune

mgerstgrasser/flashinfer

mgerstgrasser/Multi-Agent-ALE

mgerstgrasser/OpenRLHF

mgerstgrasser/ray

mgerstgrasser/slurm-dashboard

mgerstgrasser/TC4-shield

mgerstgrasser/trl

mgerstgrasser/sglang

mgerstgrasser/transformers

mgerstgrasser/vllm