schroederdewitt

Pinned Repositories

accelagent.github.io
Language:JavaScript0 0 00
facmac
Language:Python1 0 00
leapfrog-triejoin
High-performance (C++) implementation of the leapfrog-triejoin algorithm by Todd Veldhuizen (http://arxiv.org/abs/1210.0481)
Language:C++17 2 04
mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
Language:Python33 2 111
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python1 3 00
meme
This repository contains the code for the experiments in the paper "Communicating via Markov Decision Processes", accepted at ICML2022, S. Sokota*, C. Schroeder de Witt*, et al.
Language:Python1 3 50
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python11 3 02
multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
Language:Python338 8 2134
perfectly-secure-steganography
Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"
Language:Python48 4 28
rl_games
TensorFlow RL implementations
Language:Python2 2 01

schroederdewitt's Repositories

schroederdewitt/multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
Language:Python338 8 2134
schroederdewitt/perfectly-secure-steganography
Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"
Language:Python48 4 28
schroederdewitt/facmac
Language:Python1 0 00
schroederdewitt/meme
This repository contains the code for the experiments in the paper "Communicating via Markov Decision Processes", accepted at ICML2022, S. Sokota*, C. Schroeder de Witt*, et al.
Language:Python1 3 50
schroederdewitt/accelagent.github.io
Language:JavaScript0 0 00
schroederdewitt/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
Language:JavaScript0 0
schroederdewitt/annotated-s4
Implementation of https://srush.github.io/annotated-s4
Language:Python
schroederdewitt/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python0 0
schroederdewitt/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python0 0
schroederdewitt/dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
Language:Python0 0
schroederdewitt/docker-postgis
Docker image for PostGIS
Language:Dockerfile0 0
schroederdewitt/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python0 0
schroederdewitt/generativeAgent_LLM
Implementation of "Generative Agents: Interactive Simulacra of Human Behavior" paper with Guidance and Langchain. Full features and work with local LLMs.
Language:Jupyter Notebook0 0
schroederdewitt/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python1 0
schroederdewitt/gymnax
RL Environments in JAX 🌍
Language:Python0 0
schroederdewitt/gymnax-blines
Baselines for gymnax 🤖
Language:Jupyter Notebook0 0
schroederdewitt/icor-codon-optimization
RNN-based Codon Optimization Tool. Preprint paper: https://doi.org/10.1101/2021.11.08.467706
Language:Python0 0
schroederdewitt/openmalaria-gpu
A (partial) reimplementation of OpenMalaria on GPU using PyTorch
Language:Python1 0
schroederdewitt/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
Language:Python0 0
schroederdewitt/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python0 0
schroederdewitt/pymdptoolbox
Markov Decision Process (MDP) Toolbox for Python
Language:Python0 0
schroederdewitt/pytorch-forecasting
Time series forecasting with PyTorch
Language:Python0 0
schroederdewitt/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook0 0
schroederdewitt/S5
Language:Python0 0
schroederdewitt/SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
Language:Python0 0
schroederdewitt/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python0 0
schroederdewitt/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
Language:Python0 0
schroederdewitt/torrvision.github.io
Language:JavaScript0 0
schroederdewitt/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
schroederdewitt/varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
Language:Python0 0