xz259's Stars
zju-vipa/Odyssey
Odyssey: Empowering Minecraft Agents with Open-World Skills
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
ctgk/PRML
PRML algorithms implemented in Python
gerdm/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
ros2/ros2
The Robot Operating System, is a meta operating system for robots.
mantasu/cs231n
Shortest solutions for CS231n 2021-2024
cs231n/cs231n.github.io
Public facing notes page
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
MrSyee/pg-is-all-you-need
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Kaixhin/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
Farama-Foundation/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
mimoralea/gdrl
Grokking Deep Reinforcement Learning
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
kuleshov-group/aml-book
katerakelly/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
MarcoMeter/endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
ReactiveBayes/RxInfer.jl
Julia package for automated Bayesian inference on a factor graph with reactive message passing
zdhNarsil/Awesome-GFlowNets
A curated list of resources about generative flow networks (GFlowNets).
infer-actively/pymdp
A Python implementation of active inference for Markov Decision Processes
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
ftraeuble/experiments_discrete_key_value_bottleneck
Discrete Key-Value Bottleneck, ICML 2023
recursionpharma/gflownet
GFlowNet library specialized for graph & molecular data
GFNOrg/gflownet
Generative Flow Networks
marc-rigter/polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
cupy/cupy
NumPy & SciPy for GPU
enajx/HebbianMetaLearning
Meta-Learning through Hebbian Plasticity in Random Networks: https://arxiv.org/abs/2007.02686
marc-rigter/waker
Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.