StepNeverStop
Ph.D. candidate at Lamda 5 group in Nanjing University. My research interest is deep reinforcement learning.
Nanjing UniversityNan Jing
Pinned Repositories
acme
A library of reinforcement learning components and agents
ConnectSix
Connect six environment training for AI Bot
DeepLearningMethods
Crawl deep learning methods from https://paperswithcode.com/methods
RL-TF1
Reinforcement learning algorithms implemented based on tensorflow 1.x
RLs
Reinforcement Learning Algorithms Based on PyTorch
RLwithUnity
Reinforcement Leanring Algorithms Trained with Unity
Staged-Experience-Mechanism
Code of `Staged Experience Mechanism (SEM)`
StepNeverStop.github.io
TF2-RL
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO]
UnityEnvs
Reinforcement Learning Environments with ML-Agents.
StepNeverStop's Repositories
StepNeverStop/RLs
Reinforcement Learning Algorithms Based on PyTorch
StepNeverStop/DeepLearningMethods
Crawl deep learning methods from https://paperswithcode.com/methods
StepNeverStop/UnityEnvs
Reinforcement Learning Environments with ML-Agents.
StepNeverStop/RL-TF1
Reinforcement learning algorithms implemented based on tensorflow 1.x
StepNeverStop/StepNeverStop.github.io
StepNeverStop/Staged-Experience-Mechanism
Code of `Staged Experience Mechanism (SEM)`
StepNeverStop/TF2-RL
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO]
StepNeverStop/acme
A library of reinforcement learning components and agents
StepNeverStop/Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
StepNeverStop/Deep-Reinforcement-Learning-Algorithms
25 projects in the framework of Deep Reinforcement Learning algorithms: DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
StepNeverStop/StepNeverStop
StepNeverStop/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
StepNeverStop/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
StepNeverStop/d3rlpy
An offline deep reinforcement learning library
StepNeverStop/d4rl-pybullet
Datasets for Data-Driven Deep Reinforcement Learning with Pybullet environments
StepNeverStop/DeepRL
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
StepNeverStop/garage
A toolkit for reproducible reinforcement learning research
StepNeverStop/gym-collision-avoidance
StepNeverStop/highway-env
A minimalist environment for decision-making in autonomous driving
StepNeverStop/JARVIS
JARVIS, a system to connect LLMs with ML community
StepNeverStop/LAMDA-Beamer-Template
A beamer template for LAMDA lab at NJU
StepNeverStop/leeml-notes
李宏毅《机器学习》笔记,在线阅读地址:https://datawhalechina.github.io/leeml-notes
StepNeverStop/machin
Reinforcement learning library designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
StepNeverStop/NJU-health-report
用于在 GitHub Actoin 上部署南京大学每日健康填报自动打卡脚本
StepNeverStop/off-dynamics-rl
StepNeverStop/planet
Learning Latent Dynamics for Planning from Pixels
StepNeverStop/SHU-selfreport
上海大学每日一报挂机自动打卡
StepNeverStop/shuthesis
LaTeX Thesis Template for Shanghai University
StepNeverStop/unstable_baselines
Re-implementations of SOTA RL algorithms.
StepNeverStop/xingtian
xingtian is a componentized library for the development and verification of reinforcement learning algorithms