Pinned Repositories
accelagent.github.io
facmac
leapfrog-triejoin
High-performance (C++) implementation of the leapfrog-triejoin algorithm by Todd Veldhuizen (http://arxiv.org/abs/1210.0481)
mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
meme
This repository contains the code for the experiments in the paper "Communicating via Markov Decision Processes", accepted at ICML2022, S. Sokota*, C. Schroeder de Witt*, et al.
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
perfectly-secure-steganography
Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"
rl_games
TensorFlow RL implementations
schroederdewitt's Repositories
schroederdewitt/multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
schroederdewitt/perfectly-secure-steganography
Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"
schroederdewitt/facmac
schroederdewitt/meme
This repository contains the code for the experiments in the paper "Communicating via Markov Decision Processes", accepted at ICML2022, S. Sokota*, C. Schroeder de Witt*, et al.
schroederdewitt/accelagent.github.io
schroederdewitt/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
schroederdewitt/annotated-s4
Implementation of https://srush.github.io/annotated-s4
schroederdewitt/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
schroederdewitt/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
schroederdewitt/dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
schroederdewitt/docker-postgis
Docker image for PostGIS
schroederdewitt/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
schroederdewitt/generativeAgent_LLM
Implementation of "Generative Agents: Interactive Simulacra of Human Behavior" paper with Guidance and Langchain. Full features and work with local LLMs.
schroederdewitt/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
schroederdewitt/gymnax
RL Environments in JAX 🌍
schroederdewitt/gymnax-blines
Baselines for gymnax 🤖
schroederdewitt/icor-codon-optimization
RNN-based Codon Optimization Tool. Preprint paper: https://doi.org/10.1101/2021.11.08.467706
schroederdewitt/openmalaria-gpu
A (partial) reimplementation of OpenMalaria on GPU using PyTorch
schroederdewitt/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
schroederdewitt/purejaxrl
Really Fast End-to-End Jax RL Implementations
schroederdewitt/pymdptoolbox
Markov Decision Process (MDP) Toolbox for Python
schroederdewitt/pytorch-forecasting
Time series forecasting with PyTorch
schroederdewitt/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
schroederdewitt/S5
schroederdewitt/SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
schroederdewitt/spinningup
An educational resource to help anyone learn deep reinforcement learning.
schroederdewitt/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
schroederdewitt/torrvision.github.io
schroederdewitt/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
schroederdewitt/varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)