xz259

xz259's Stars

zju-vipa/Odyssey
Odyssey: Empowering Minecraft Agents with Open-World Skills
Language:JavaScript27817
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Language:Python11.6k1k
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
Language:C++32.1k3.7k
ctgk/PRML
PRML algorithms implemented in Python
Language:Jupyter Notebook11.5k3.3k
gerdm/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
Language:Jupyter Notebook2.2k498
ros2/ros2
The Robot Operating System, is a meta operating system for robots.
3.7k685
mantasu/cs231n
Shortest solutions for CS231n 2021-2024
Language:Jupyter Notebook27463
cs231n/cs231n.github.io
Public facing notes page
Language:Jupyter Notebook10.3k4.1k
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python6k675
MrSyee/pg-is-all-you-need
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
Language:Jupyter Notebook875120
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python2.2k524
Kaixhin/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
Language:Python1.6k286
Farama-Foundation/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Language:Python59393
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Language:Python7.7k866
mimoralea/gdrl
Grokking Deep Reinforcement Learning
Language:Jupyter Notebook832237
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9.4k1.7k
kuleshov-group/aml-book
Language:Jupyter Notebook5624
katerakelly/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Language:Python477125
MarcoMeter/endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
Language:Python912
ReactiveBayes/RxInfer.jl
Julia package for automated Bayesian inference on a factor graph with reactive message passing
Language:Jupyter Notebook28525
zdhNarsil/Awesome-GFlowNets
A curated list of resources about generative flow networks (GFlowNets).
42827
infer-actively/pymdp
A Python implementation of active inference for Markov Decision Processes
Language:Python484100
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python2.1k614
ftraeuble/experiments_discrete_key_value_bottleneck
Discrete Key-Value Bottleneck, ICML 2023
Language:Jupyter Notebook22
recursionpharma/gflownet
GFlowNet library specialized for graph & molecular data
Language:Python22143
GFNOrg/gflownet
Generative Flow Networks
Language:Python61377
marc-rigter/polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
Language:Python612
cupy/cupy
NumPy & SciPy for GPU
Language:Python9.6k862
enajx/HebbianMetaLearning
Meta-Learning through Hebbian Plasticity in Random Networks: https://arxiv.org/abs/2007.02686
Language:Python13022
marc-rigter/waker
Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.
Language:Python272