sandguine

👩‍💻🧘🤸🏳️‍🌈

University of California, BerkeleySan Francisco, Bay Area

Pinned Repositories

AsymmPlay
Language:Jupyter Notebook0 3 00
MarvinGPT
Language:Python3 2 02
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:Python1 0 00
cicero
running cicero on google colab
Language:Jupyter Notebook1 1 00
concordia
A library for generative social simulation
Language:Python1 0 00
diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Language:Python0 0 00
Melting-Pot-Contest-2023
Language:Python1 0 00
meltingpot
A suite of test scenarios for multi-agent reinforcement learning.
Language:Python1 0 00
mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"
Language:Jupyter Notebook1 1 00
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++1 1 00

sandguine's Repositories

sandguine/concordia
A library for generative social simulation
Language:Python1 0 00
sandguine/Melting-Pot-Contest-2023
Language:Python1 0 00
sandguine/meltingpot
A suite of test scenarios for multi-agent reinforcement learning.
Language:Python1 0 00
sandguine/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0
sandguine/distributional-sr
Official implementation of the δ-model presented in the paper "A Distributional Analogue to the Successor Representation".
sandguine/effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
sandguine/fast-marl
FAST iteration of MARL research ideas: A starting point for Multi-Agent Reinforcement Learning
Language:Python0 0
sandguine/hanabi.github.io
A list of Hanabi strategies
sandguine/hidden-context
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
Language:Python0 0
sandguine/icvf_release
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
sandguine/JaxMARL-minimal-information
Multi-Agent Reinforcement Learning with JAX
Language:Python
sandguine/lab2d
A customisable 2D platform for agent-based AI research
Language:C++0 0
sandguine/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python0 0
sandguine/Mava
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
sandguine/maxtext
A simple, performant and scalable Jax LLM!
sandguine/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
Language:Python0 0
sandguine/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Language:Jupyter Notebook0 0
sandguine/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python0 0
sandguine/Neural-Network-Zero-to-Hero
Writing keys libraries and core architectures from scratch. Following the tutorials of Neural Network Zero to Hero class from Andrej Karphathy.
1 0
sandguine/nn-zero-to-hero
Neural Networks: Zero to Hero
Language:Jupyter Notebook0 0
sandguine/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
sandguine/paper-reviewer-matcher
Linear programming solver for paper-reviewer matching and mind-matching
Language:Python
sandguine/pax
Scalable Opponent Shaping Experiments in JAX
Language:Python0 0
sandguine/purejaxrl
Really Fast End-to-End Jax RL Implementations
sandguine/pycid
Library for graphical models of decision making, based on pgmpy and networkx
sandguine/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python0 0
sandguine/redpoint_hacks
Language:Python0 0
sandguine/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
sandguine/SAELens
Training Sparse Autoencoders on Language Models
Language:HTML0 0
sandguine/Voyager-Contracts
CAIF
Language:JavaScript0 0