jifei-deng's Stars
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
KindXiaoming/pykan
Kolmogorov Arnold Networks
pnnl/neuromancer
Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.
liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
QingyuZhao/VAE-for-Regression
A toy example of VAE-regression network
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
grpc/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
facebookresearch/playtorch
PlayTorch is a framework for rapidly creating mobile AI experiences.
towardsai/tutorials
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
kschweig/OfflineRL
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Stable-Baselines-Team/stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
kchua/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
hemerson1/offline-glucose
The code release for "Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 Diabetes".
young-geng/CQL
Conservative Q Learning on top of SAC
aviralkumar2907/CQL
Code for conservative Q-learning
instadeepai/Mava
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
forgi86/sysid-neural-continuous
Continuous-time system identification with neural networks
haozhg/oml
AI4Science: Efficient data-driven Online Model Learning (OML) / system identification and control
dgedon/DeepSSM_SysID
Official PyTorch implementation of "Deep State Space Models for Nonlinear System Identification", 2020.
CPCLAB-UNIPI/SIPPY
Systems Identification Package for PYthon
wilsonrljr/sysidentpy
A Python Package For System Identification Using NARMAX Models
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch