charleshsc

Ph.D. student in @ SJTU. Did Research in @Thinklab-SJTU, @PJLab-ADG, @OpenPerceptionX

SJTUShanghai

charleshsc's Stars

charleshsc/HarmoDT
ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Language:Python81
charleshsc/CommFormer
ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective
Language:Python12
charleshsc/QT
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
Language:Python142
Shanghai-Digital-Brain-Laboratory/BDM-DB1
A large-scale multi-modal pre-trained model
Language:Python1299
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.1k124
MAGIC-AI4Med/KEP
[ECCV 2024 Oral] Knowledge-enhanced pretraining for computational pathology
Language:Python17
senseek/piaoxingqiu
票星球自动抢票
14961
joansj/hat
Overcoming catastrophic forgetting with hard attention to the task
Language:Python20252
Lucasc-99/PackNet-Continual-Learning
The PackNet Continual Learning Method in Pytorch
Language:Python143
arunmallya/piggyback
Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights
Language:Python18027
ShiArthur03/ShiArthur03
Language:MATLAB10.4k1.9k
mmasana/FACIL
Framework for Analysis of Class-Incremental Learning with 12 state-of-the-art methods and 3 baselines.
Language:Python52499
awarelab/continual_world
Language:Python8216
NJU-RL/CuGRO
Language:Python7
AGI-Labs/continual_rl
Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.
Language:Python10211
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.4k616
stevenyangyj/CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Language:Python141
mikelma/componet
Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)
Language:Python91
median-research-group/LibMTL
A PyTorch Library for Multi-Task Learning
Language:Python2k182
TToTMooN/paco-mtrl
Language:Python243
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.8k2.1k
takuseno/d3rlpy
An offline deep reinforcement learning library
Language:Python1.3k232
tinnerhrhe/MTDiff
Language:Python484
EstrellaXD/Auto_Bangumi
AutoBangumi - 全自动追番工具
Language:Python6.7k348
sfujim/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
Language:Python31848
young-geng/CQL
Conservative Q Learning on top of SAC
Language:Python11824
TonghanWang/NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
Language:Python8216
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1.8k382
Zhendong-Wang/Diffusion-Policies-for-Offline-RL
Language:Python25436
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.3k785

charleshsc

charleshsc's Stars

charleshsc/HarmoDT

charleshsc/CommFormer

charleshsc/QT

Shanghai-Digital-Brain-Laboratory/BDM-DB1

tinkoff-ai/CORL

MAGIC-AI4Med/KEP

senseek/piaoxingqiu

joansj/hat

Lucasc-99/PackNet-Continual-Learning

arunmallya/piggyback

ShiArthur03/ShiArthur03

mmasana/FACIL

awarelab/continual_world

NJU-RL/CuGRO

AGI-Labs/continual_rl

vwxyzjn/cleanrl

stevenyangyj/CoTASP

mikelma/componet

median-research-group/LibMTL

TToTMooN/paco-mtrl

hpcaitech/Open-Sora

takuseno/d3rlpy

tinnerhrhe/MTDiff

EstrellaXD/Auto_Bangumi

sfujim/TD3_BC

young-geng/CQL

TonghanWang/NDQ

oxwhirl/pymarl

Zhendong-Wang/Diffusion-Policies-for-Offline-RL

openai/multiagent-particle-envs