Amanda2024

Amanda2024's Stars

songyingxin/NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题
2.6k 55 6505
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Language:Python833 12 62128
pranz24/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python803 9 37179
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python604 16 40118
ChenglongChen/pytorch-DRL
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Language:Python524 12 7106
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python456 6 2063
Lizhi-sjtu/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Language:Python417 2 2354
Kaixhin/ACER
Actor-critic with experience replay
Language:Python251 13 1346
Kyushik/DRL
Repository for codes of 'Deep Reinforcement Learning'
Language:Python214 10 043
oxwhirl/smacv2
Language:Python202 5 3331
imhuay/Algorithm_Interview_Notes-Chinese-backups
Language:Python155 2 025
google-research/relay-policy-learning
Language:Python103 8 628
BladeDancer957/CPFD
Language:Python87 7 116
lich14/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
Language:Python83 1 1120
X-PLUG/mPLUG
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Language:Python81 2 106
shariqiqbal2810/REFIL
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
Language:Python61 2 313
BladeDancer957/DualGATs
Code for ACL2023 paper 《DualGATs: Dual Graph Attention Networks for Emotion Recognition in Conversations》
Language:Python58 4 912
Chacha-Chen/MPLight
Language:Python56 4 1819
UnrealTracking/ToM2C
The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .
Language:Python55 2 110
BladeDancer957/INER_RDP
Language:Python42 3 08
BladeDancer957/TSAM
The code for COLING2022 paper: 《TSAM: A Two-Stream Attention Model for Causal Emotion Entailment》
Language:Python36 2 31
BladeDancer957/SPN-GA
Language:Python32 1 00
atavakol/action-hypergraph-networks
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
Language:Python21 1 16
lujiaming-12138/DuaLight
Language:Python10 1 31
preciousxin/Qatten_Multiagent_RL
Implement of Qatten on SMAC (updating)
Language:Python6 1 03
mike-gimelfarb/contextual-policy-reuse-deep-rl
Framework for Contextually Transferring Knowledge from Multiple Source Policies in Deep Reinforcement Learning
3 2 00
BladeDancer957/BladeDancer957
21
BladeDancer957/FISS
[CVPR2023] Federated Incremental Semantic Segmentation
Language:Python2 0 00
BladeDancer957/controlvideo
Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"
Language:Python10
BladeDancer957/WeTS
A benchmark for the task of translation suggestion
Language:Mask1 0 00