rezunli96's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
electronicarts/CnC_Remastered_Collection
Command & Conquer: Remastered Collection
eriklindernoren/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
google-deepmind/sonnet
TensorFlow-based neural network library
bayesian-optimization/BayesianOptimization
A Python implementation of global optimization with gaussian processes.
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
ReaVNaiL/New-Grad-2024
👋 Hey there new grad🎉! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! 🚀
lightvector/KataGo
GTP engine and self-play learning in Go
google-deepmind/acme
A library of reinforcement learning components and agents
google-deepmind/dm-haiku
JAX-based neural network library
google-deepmind/mctx
Monte Carlo tree search in JAX
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
n2cholas/awesome-jax
JAX - A curated list of resources https://github.com/google/jax
tensorly/tensorly
TensorLy: Tensor Learning in Python.
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
google-deepmind/concordia
A library for generative social simulation
instadeepai/Mava
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
sotetsuk/pgx
♟️ Vectorized RL game environments in JAX
google-deepmind/launchpad
facebookresearch/nocturne
A data-driven, fast driving simulator for multi-agent coordination under partial observability.
IC3Net/IC3Net
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
facebookresearch/minimax
Efficient baselines for autocurricula in JAX.
causalincentives/pycid
Library for graphical models of decision making, based on pgmpy and networkx
duchi-lab/certifiable-distributional-robustness
Certifying Some Distributional Robustness with Principled Adversarial Training (https://arxiv.org/abs/1710.10571)
asappresearch/emergent-comms-negotiation
Reproduce ICLR2018 submission "Emergent Communication through Negotiation"
mnoukhov/emergent-compete
Code for Emergent Communication under Competition (AAMAS 2021)
yasserfarouk/scml
ANAC Supply Chain Management League Development Environment
iassael/learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch