micolxs

micolxs's Stars

binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python67.7k 279 1.7k8.3k
google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook10.6k 422 1721.4k
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python10.6k 232 2922.3k
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook10.4k 79 1491.9k
DEAP/deap
Distributed Evolutionary Algorithms in Python
Language:Python6k 190 5211.1k
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python4.2k 63 952724
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language:Python3.2k 22 216388
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language:Python1.9k 9 9236
SebLague/Slime-Simulation
Language:C#1.4k 24 10251
rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python1.3k 37 103245
agi-brain/xuance
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Language:Python746 15 88117
xahidbuffon/Awesome_Underwater_Datasets
Pointers to large-scale underwater datasets and relevant resources.
581 14 5127
FilippoAiraldi/mpc-reinforcement-learning
Reinforcement Learning with Model Predictive Control
Language:Python366 6 847
baopng/NSGA-II
Implementation of NSGA-II algorithm in form of a python library.
Language:Python204 5 850
Valdecy/pyMultiobjective
A python library for the following Multiobjective Optimization Algorithms or Many Objectives Optimization Algorithms: C-NSGA II; CTAEA; GrEA; HypE; IBEA-FC; IBEA-HV; MOEA/D; NAEMO; NSGA II; NSGA III; OMOPSO; PAES; RVEA; SMPSO; SMS-EMOA; SPEA2; U-NSGA III
Language:Python169 5 126
jlubars/RL-MPC-LaneMerging
Combining Reinforcement Learning with Model Predictive Control for On-Ramp Merging
Language:Python126 1 1022
lmarti/nsgaiii
An implementation of NSGA-III in Python.
Language:Jupyter Notebook113 10 152
FilippoAiraldi/learning-safety-in-mpc-based-rl
Safety-aware MPC-based RL framework
Language:Python51 3 08
xinliu20/MEC
Language:Python42 12 62
baggepinnen/Robotlib.jl
Robotics library written in the Julia programming language
Language:Julia40 6 109
MAS-anony/ASN
Language:Python32 2 17
UtkarshMishra04/DMD-MPC-RL
This repository contains the code for our paper on Dynamic Mirror Descent based MPC for Model-Free RL
Language:Python23 3 04
xinliu20/GraphCSPN_ECCV2022
Language:Python21 2 71
baggepinnen/LTVModels.jl
Tools to estimate Linear Time-Varying models in Julia
Language:Julia19 4 23
yeshenpy/PMIC
Original PyTorch implementation of PMIC from PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Language:Python18 3 32
mrjun123/DPETS
Code for the paper "Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling"
Language:Python131
baggepinnen/LPVSpectral.jl
Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.
Language:Julia12 4 66
AdrienLin1/MACDPP
Code for Paper "Effective Multi-agent Reinforcement Learning Control with Relative Entropy Regularization".
Language:Python7 1 01
zhaoshitian/rl-tutorials
basic algorithms of reinforcement learning
Language:Jupyter Notebook7 0 01
itstyren/reputationRL-coop
Code for reproducing the experimental results presented in the paper "Reputation-based Interaction Promotes Cooperation with Reinforcement Learning."
Language:Jupyter Notebook2 1 00