pseudo-rnd-thoughts
PhD Student at the University of Southampton exploring Explainable Reinforcement Learning
pseudo-rnd-thoughts's Stars
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
arcee-ai/mergekit
Tools for merging pretrained large language models.
chrxh/alien
ALIEN is a CUDA-powered artificial life simulation program.
facebookresearch/sapiens
High-resolution models for human tasks.
danijar/dreamerv3
Mastering Diverse Domains through World Models
younader/Vesuvius-Grandprize-Winner
oTree-org/oTree
Python framework for multiplayer decision games, behavioral experiments, and surveys
salesforce/warp-drive
Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
ndif-team/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
nicklashansen/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
google-deepmind/treescope
An interactive HTML pretty-printer for machine learning research in IPython notebooks.
proroklab/VectorizedMultiAgentSimulator
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
EdanToledo/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
zuoxingdong/mazelab
A customizable framework to create maze and gridworld environments
Emerge-Lab/gpudrive
GPU-acceleration of Nocturne via Madrona
imbue-ai/carbs
Cost aware hyperparameter tuning algorithm
mttga/purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
epignatelli/navix
Accelerated minigrid environments with JAX
Farama-Foundation/momaland
Benchmarks for Multi-Objective Multi-Agent Decision Making
adityab/CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
k4ntz/OC_Atari
Object Centric Atari games
BricksRL/bricksrl
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
strakam/generals-bots
Develop your agent for generals.io!
nikaashpuri/sarfa-saliency
smearle/autoverse
Generative cellular automaton-like learning environments for RL.
brownirl/lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
dadecampo/aquatic_navigation_envs
Aquatic navigation environments for Gym
adaptive-intelligent-robotics/QDAC
Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" (ICML 2024).
k4ntz/HackAtari
smearle/pcgrl-jax