carlosgmartin

carlosgmartin's Stars

google-research/google-research
Google Research
Language:Jupyter Notebook34.7k 753 1.3k8k
nushell/nushell
A new type of shell
Language:Rust33.4k 190 6k1.7k
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python27.5k 275 8423.1k
getcursor/cursor
The AI Code Editor
26.8k 211 2.3k1.7k
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python6.1k 37 186684
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python3.1k 46 297360
Thinklab-SJTU/awesome-ml4co
Awesome machine learning for combinatorial optimization papers.
Language:Python1.8k 41 2203
n2cholas/awesome-jax
JAX - A curated list of resources https://github.com/google/jax
1.7k 51 8134
google/uncertainty-baselines
High-quality implementations of standard and SOTA methods on a variety of tasks.
Language:Python1.5k 22 89205
joboccara/pipes
Pipelines for expressive code on collections in C++
Language:C++812 34 4480
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python793 13 2468
RobertTLange/gymnax
RL Environments in JAX 🌍
Language:Python680 10 5561
swansonk14/typed-argument-parser
Typed argument parser for Python
Language:Python536 7 10742
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python478 10 4093
mpi4jax/mpi4jax
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python :zap:
Language:Python456 11 9230
konstmish/prodigy
The Prodigy optimizer and its variants for training neural networks.
Language:Python361 6 1923
mpSchrader/gym-sokoban
Sokoban environment for OpenAI Gym
Language:Python334 10 3679
imagry/aleph_star
Reinforcement learning with A* and a deep heuristic
Language:Jupyter Notebook288 19 534
mfinzi/equivariant-MLP
A library for programmatically generating equivariant layers through constraint solving
Language:Jupyter Notebook259 10 1522
gehring/fax
Language:Python80 9 199
jinwen-yang/cuPDLP.jl
Language:Julia57 7 212
bwfbowen/muax
A project that provides help for using DeepMind's mctx on gym-style environments.
Language:Python52 5 1110
ChezJrk/Teg
A differentiable programming language with an integration primitive that soundly handles interactions among the derivative, integral, and discontinuities.
Language:Python37 5 194
petosa/multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
Language:Python35 6 29
py-stockfish/stockfish
Integrates the Stockfish chess engine with Python (Official fork)
Language:Python33 2 589
arvoelke/nengolib
Nengo library of additional extensions
Language:Python29 4 1216
aletcher/stable-opponent-shaping
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
Language:Jupyter Notebook21 3 02
lansiz/nash-finder
Find Nash equilibrium for all games
Language:Python19 1 03
levilelis/h-levin
Levin tree search guided by both a policy and a heuristic function
Language:Python16 5 67
lowrollr/mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
Language:Python15 1 10