carlosgmartin's Stars
google-research/google-research
Google Research
nushell/nushell
A new type of shell
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
getcursor/cursor
The AI Code Editor
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
Thinklab-SJTU/awesome-ml4co
Awesome machine learning for combinatorial optimization papers.
n2cholas/awesome-jax
JAX - A curated list of resources https://github.com/google/jax
google/uncertainty-baselines
High-quality implementations of standard and SOTA methods on a variety of tasks.
joboccara/pipes
Pipelines for expressive code on collections in C++
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
RobertTLange/gymnax
RL Environments in JAX 🌍
swansonk14/typed-argument-parser
Typed argument parser for Python
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
mpi4jax/mpi4jax
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python :zap:
konstmish/prodigy
The Prodigy optimizer and its variants for training neural networks.
mpSchrader/gym-sokoban
Sokoban environment for OpenAI Gym
imagry/aleph_star
Reinforcement learning with A* and a deep heuristic
mfinzi/equivariant-MLP
A library for programmatically generating equivariant layers through constraint solving
gehring/fax
jinwen-yang/cuPDLP.jl
bwfbowen/muax
A project that provides help for using DeepMind's mctx on gym-style environments.
ChezJrk/Teg
A differentiable programming language with an integration primitive that soundly handles interactions among the derivative, integral, and discontinuities.
petosa/multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
py-stockfish/stockfish
Integrates the Stockfish chess engine with Python (Official fork)
arvoelke/nengolib
Nengo library of additional extensions
aletcher/stable-opponent-shaping
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
lansiz/nash-finder
Find Nash equilibrium for all games
levilelis/h-levin
Levin tree search guided by both a policy and a heuristic function
lowrollr/mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree