rezunli96

Google DeepMindNew York

rezunli96's Stars

openai/openai-cookbook
Examples and guides for using the OpenAI API
Language:MDX62.4k 913 52610.1k
electronicarts/CnC_Remastered_Collection
Command & Conquer: Remastered Collection
Language:C++20.9k 540 05.3k
eriklindernoren/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
Language:Python16.9k 221 1584.1k
google-deepmind/sonnet
TensorFlow-based neural network library
Language:Python9.8k 421 1931.3k
bayesian-optimization/BayesianOptimization
A Python implementation of global optimization with gaussian processes.
Language:Python8.1k 131 3761.6k
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python6.6k 39 193724
ReaVNaiL/New-Grad-2024
👋 Hey there new grad🎉! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! 🚀
Language:Python6.4k 2.1k 0569
lightvector/KataGo
GTP engine and self-play learning in Go
Language:C++3.8k 79 840587
google-deepmind/acme
A library of reinforcement learning components and agents
Language:Python3.6k 82 268457
google-deepmind/dm-haiku
JAX-based neural network library
Language:Python3k 35 249242
google-deepmind/mctx
Monte Carlo tree search in JAX
Language:Python2.4k 26 52198
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python2k 29 131396
n2cholas/awesome-jax
JAX - A curated list of resources https://github.com/google/jax
1.7k 51 8139
tensorly/tensorly
TensorLy: Tensor Learning in Python.
Language:Python1.6k 43 275294
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.4k 19 23140
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python839 13 2469
google-deepmind/concordia
A library for generative social simulation
Language:Python815 23 43177
instadeepai/Mava
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Language:Python786 15 458102
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python539 12 44107
sotetsuk/pgx
♟️ Vectorized RL game environments in JAX
Language:Python455 8 24633
google-deepmind/launchpad
Language:Python319 17 4040
facebookresearch/nocturne
A data-driven, fast driving simulator for multi-agent coordination under partial observability.
Language:Python273 13 4630
IC3Net/IC3Net
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
Language:Python217 4 1151
facebookresearch/minimax
Efficient baselines for autocurricula in JAX.
Language:Python186 6 517
causalincentives/pycid
Library for graphical models of decision making, based on pgmpy and networkx
Language:Jupyter Notebook105 6 6115
duchi-lab/certifiable-distributional-robustness
Certifying Some Distributional Robustness with Principled Adversarial Training (https://arxiv.org/abs/1710.10571)
Language:Python45 5 012
asappresearch/emergent-comms-negotiation
Reproduce ICLR2018 submission "Emergent Communication through Negotiation"
Language:Python17 6 28
mnoukhov/emergent-compete
Code for Emergent Communication under Competition (AAMAS 2021)
Language:Jupyter Notebook10 2 101
yasserfarouk/scml
ANAC Supply Chain Management League Development Environment
Language:Jupyter Notebook10 3 247
iassael/learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Language:Python4 2 01

rezunli96

rezunli96's Stars

openai/openai-cookbook

electronicarts/CnC_Remastered_Collection

eriklindernoren/PyTorch-GAN

google-deepmind/sonnet

bayesian-optimization/BayesianOptimization

vwxyzjn/cleanrl

ReaVNaiL/New-Grad-2024

lightvector/KataGo

google-deepmind/acme

google-deepmind/dm-haiku

google-deepmind/mctx

oxwhirl/pymarl

n2cholas/awesome-jax

tensorly/tensorly

Farama-Foundation/chatarena

luchris429/purejaxrl

google-deepmind/concordia

instadeepai/Mava

FLAIROx/JaxMARL

sotetsuk/pgx

google-deepmind/launchpad

facebookresearch/nocturne

IC3Net/IC3Net

facebookresearch/minimax

causalincentives/pycid

duchi-lab/certifiable-distributional-robustness

asappresearch/emergent-comms-negotiation

mnoukhov/emergent-compete

yasserfarouk/scml

iassael/learning-to-communicate-pytorch