zcchenvy's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
openai/consistency_models
Official repo for consistency models.
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
google-deepmind/android_env
RL research on Android devices.
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
MolecularAI/aizynthfinder
A tool for retrosynthetic planning
voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
vikashplus/robohive
A unified framework for robot learning
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
boyu-ai/Hands-on-ML
https://hml.boyuai.com
CleanDiffuserTeam/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Dungyichao/Electric-Vehicle-Route-Planning-on-Google-Map-Reinforcement-Learning
User can set up destination for any agent to navigate on Google Map and learn the best route for the agent based on its current condition and the traffic. Our result is 10% less energy consumption than the route provided by Google map
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
dhruvramani/Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
chauncygu/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
martyput/MDP_book
ffelten/MASAC
Jax and Torch Multi-Agent SAC on PettingZoo API
FXDevailly/IG-RL
Inductive Graph Reinforcement Learning for Massive-Scale Traffic Signal Control
HzcIrving/DecisionTransformer_StepbyStep
Decision Transformer: A brand new Offline RL Pattern.
marina-haliem/Dynamic-RideSharing-Pooling-Simulator
A Simulator for Dynamic Ride-Sharing with Pooling: Joint Matching,Pricing, Route Planning, and Dispatching
paulorocosta/genetic-algorithm-GVRP
Implementation of the paper A Genetic Algorithm for a Green Vehicle Routing Problem
LucasAlegre/sfols
Code for the paper Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer - ICML 2022
serl-robot/serl
A Software Suite for Sample-Efficient Robotic Reinforcement Learning
lich14/Traffic_Light_Transfer_Control
RL-DLMU/GNSD-Light
RL-DLMU/VF-MAPPO