jifei-deng

Aalto UniversityEspoo, Finland

jifei-deng's Stars

LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
4k720
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook14.6k1.3k
pnnl/neuromancer
Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.
Language:Python870114
liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
Language:Python654
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python64k7.9k
QingyuZhao/VAE-for-Regression
A toy example of VAE-regression network
Language:Jupyter Notebook7021
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1.8k380
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python2.2k290
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python10k2.2k
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.3k278
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
Language:Python466135
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.3k785
grpc/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Language:C++41.6k10.5k
facebookresearch/playtorch
PlayTorch is a framework for rapidly creating mobile AI experiences.
Language:MDX829101
towardsai/tutorials
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
Language:Jupyter Notebook983365
kschweig/OfflineRL
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Language:Jupyter Notebook246
Stable-Baselines-Team/stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python28360
kchua/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Language:Python42498
hemerson1/offline-glucose
The code release for "Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 Diabetes".
Language:Python159
young-geng/CQL
Conservative Q Learning on top of SAC
Language:Python11824
aviralkumar2907/CQL
Code for conservative Q-learning
Language:Python39369
instadeepai/Mava
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Language:Python69683
forgi86/sysid-neural-continuous
Continuous-time system identification with neural networks
Language:Python249
haozhg/oml
AI4Science: Efficient data-driven Online Model Learning (OML) / system identification and control
Language:Python285
dgedon/DeepSSM_SysID
Official PyTorch implementation of "Deep State Space Models for Nonlinear System Identification", 2020.
Language:Python8824
CPCLAB-UNIPI/SIPPY
Systems Identification Package for PYthon
Language:Python27092
wilsonrljr/sysidentpy
A Python Package For System Identification Using NARMAX Models
Language:Python38077
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python1.7k433
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python33.1k5.6k
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python3.2k677

jifei-deng

jifei-deng's Stars

LantaoYu/MARL-Papers

KindXiaoming/pykan

pnnl/neuromancer

liuzuxin/DSRL

binary-husky/gpt_academic

QingyuZhao/VAE-for-Regression

oxwhirl/pymarl

pytorch/rl

openai/spinningup

Farama-Foundation/D4RL

uoe-agents/epymarl

openai/multiagent-particle-envs

grpc/grpc

facebookresearch/playtorch

towardsai/tutorials

kschweig/OfflineRL

Stable-Baselines-Team/stable-baselines

kchua/handful-of-trials

hemerson1/offline-glucose

young-geng/CQL

aviralkumar2907/CQL

instadeepai/Mava

forgi86/sysid-neural-continuous

haozhg/oml

dgedon/DeepSSM_SysID

CPCLAB-UNIPI/SIPPY

wilsonrljr/sysidentpy

sfujim/TD3

ray-project/ray

ShangtongZhang/DeepRL