zcchenvy

RL 和 multiagent RL领域的一个白痴

Dalian Maritime UniversityDalian

zcchenvy's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python166k 1.6k 2.6k44k
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python11.9k 123 3531.1k
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python7.8k 91 7371.1k
openai/consistency_models
Official repo for consistency models.
Language:Python6.1k 59 51410
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Language:Jupyter Notebook3k 73 22588
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
2.7k 99 13221
google-deepmind/android_env
RL research on Android devices.
Language:Python997 30 2770
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
Language:Python621 7 5760
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Language:Python609 3 695
MolecularAI/aizynthfinder
A tool for retrosynthetic planning
Language:Python570 29 106130
voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Language:Python537 11 2364
vikashplus/robohive
A unified framework for robot learning
Language:Python489 11 4682
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
Language:Python447 7 4452
boyu-ai/Hands-on-ML
https://hml.boyuai.com
Language:Jupyter Notebook310 3 675
CleanDiffuserTeam/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Language:Jupyter Notebook280 0 922
Dungyichao/Electric-Vehicle-Route-Planning-on-Google-Map-Reinforcement-Learning
User can set up destination for any agent to navigate on Google Map and learn the best route for the agent based on its current condition and the traffic. Our result is 10% less energy consumption than the route provided by Google map
Language:Python232 13 764
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
208 8 39
dhruvramani/Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
Language:Python167 4 422
chauncygu/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
Language:Python137 2 1023
martyput/MDP_book
84 8 16
ffelten/MASAC
Jax and Torch Multi-Agent SAC on PettingZoo API
Language:Python57 1 26
FXDevailly/IG-RL
Inductive Graph Reinforcement Learning for Massive-Scale Traffic Signal Control
Language:Python53 1 18
HzcIrving/DecisionTransformer_StepbyStep
Decision Transformer: A brand new Offline RL Pattern.
Language:Python32 2 11
marina-haliem/Dynamic-RideSharing-Pooling-Simulator
A Simulator for Dynamic Ride-Sharing with Pooling: Joint Matching,Pricing, Route Planning, and Dispatching
Language:Python21 2 78
paulorocosta/genetic-algorithm-GVRP
Implementation of the paper A Genetic Algorithm for a Green Vehicle Routing Problem
Language:Python20 1 01
LucasAlegre/sfols
Code for the paper Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer - ICML 2022
Language:Python8 2 01
serl-robot/serl
A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Language:Python7 2 02
lich14/Traffic_Light_Transfer_Control
Language:Python4 1 00
RL-DLMU/GNSD-Light
Language:Python3 1 1
RL-DLMU/VF-MAPPO
Language:Python1 1 0

zcchenvy

zcchenvy's Stars

Significant-Gravitas/AutoGPT

OpenMOSS/MOSS

thu-ml/tianshou

openai/consistency_models

higgsfield/RL-Adventure

GT-RIPL/Awesome-LLM-Robotics

google-deepmind/android_env

OpenRL-Lab/openrl

vwxyzjn/ppo-implementation-details

MolecularAI/aizynthfinder

voidful/TextRL

vikashplus/robohive

PKU-MARL/HARL

boyu-ai/Hands-on-ML

CleanDiffuserTeam/CleanDiffuser

Dungyichao/Electric-Vehicle-Route-Planning-on-Google-Map-Reinforcement-Learning

floodsung/LLM-with-RL-papers

dhruvramani/Transformers-RL

chauncygu/Multi-Agent-Constrained-Policy-Optimisation

martyput/MDP_book

ffelten/MASAC

FXDevailly/IG-RL

HzcIrving/DecisionTransformer_StepbyStep

marina-haliem/Dynamic-RideSharing-Pooling-Simulator

paulorocosta/genetic-algorithm-GVRP

LucasAlegre/sfols

serl-robot/serl

lich14/Traffic_Light_Transfer_Control

RL-DLMU/GNSD-Light

RL-DLMU/VF-MAPPO