Amanda2024's Stars
songyingxin/NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
pranz24/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
ChenglongChen/pytorch-DRL
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Lizhi-sjtu/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Kaixhin/ACER
Actor-critic with experience replay
Kyushik/DRL
Repository for codes of 'Deep Reinforcement Learning'
oxwhirl/smacv2
imhuay/Algorithm_Interview_Notes-Chinese-backups
google-research/relay-policy-learning
BladeDancer957/CPFD
lich14/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
X-PLUG/mPLUG
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
shariqiqbal2810/REFIL
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
BladeDancer957/DualGATs
Code for ACL2023 paper 《DualGATs: Dual Graph Attention Networks for Emotion Recognition in Conversations》
Chacha-Chen/MPLight
UnrealTracking/ToM2C
The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .
BladeDancer957/INER_RDP
BladeDancer957/TSAM
The code for COLING2022 paper: 《TSAM: A Two-Stream Attention Model for Causal Emotion Entailment》
BladeDancer957/SPN-GA
atavakol/action-hypergraph-networks
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
lujiaming-12138/DuaLight
preciousxin/Qatten_Multiagent_RL
Implement of Qatten on SMAC (updating)
mike-gimelfarb/contextual-policy-reuse-deep-rl
Framework for Contextually Transferring Knowledge from Multiple Source Policies in Deep Reinforcement Learning
BladeDancer957/BladeDancer957
BladeDancer957/FISS
[CVPR2023] Federated Incremental Semantic Segmentation
BladeDancer957/controlvideo
Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"
BladeDancer957/WeTS
A benchmark for the task of translation suggestion