charleshsc
Ph.D. student in @ SJTU. Did Research in @Thinklab-SJTU, @PJLab-ADG, @OpenPerceptionX
SJTUShanghai
charleshsc's Stars
charleshsc/HarmoDT
ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
charleshsc/CommFormer
ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective
charleshsc/QT
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
Shanghai-Digital-Brain-Laboratory/BDM-DB1
A large-scale multi-modal pre-trained model
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
MAGIC-AI4Med/KEP
[ECCV 2024 Oral] Knowledge-enhanced pretraining for computational pathology
senseek/piaoxingqiu
票星球自动抢票
joansj/hat
Overcoming catastrophic forgetting with hard attention to the task
Lucasc-99/PackNet-Continual-Learning
The PackNet Continual Learning Method in Pytorch
arunmallya/piggyback
Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights
ShiArthur03/ShiArthur03
mmasana/FACIL
Framework for Analysis of Class-Incremental Learning with 12 state-of-the-art methods and 3 baselines.
awarelab/continual_world
NJU-RL/CuGRO
AGI-Labs/continual_rl
Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
stevenyangyj/CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
mikelma/componet
Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)
median-research-group/LibMTL
A PyTorch Library for Multi-Task Learning
TToTMooN/paco-mtrl
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
takuseno/d3rlpy
An offline deep reinforcement learning library
tinnerhrhe/MTDiff
EstrellaXD/Auto_Bangumi
AutoBangumi - 全自动追番工具
sfujim/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
young-geng/CQL
Conservative Q Learning on top of SAC
TonghanWang/NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Zhendong-Wang/Diffusion-Policies-for-Offline-RL
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"