Amanda2024

Pinned Repositories

AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
Language:Python0 0 00
Algorithm_Interview_Notes-Chinese-backups
Language:Python0 0 00
attention-learn-to-route
Attention based model for learning to solve different routing problems
Language:Jupyter Notebook0 0 00
CARE-SMAC-MA_SAC
Multi-task Multi-agent Soft Actor Critic for SMAC
Language:Python12 1 01
Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
Language:Python2 0 00
EFA-DWM
Language:Python5 1 11
GCS_aamas337
The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》
Language:Python36 2 37
Papers-of-MARL
3 1 00
Papers-of-Offline-RL
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
1 0 00
PyTorch-Tutorial
Build your neural network easy and fast
Language:Jupyter Notebook0 0 00

Amanda2024's Repositories

Amanda2024/GCS_aamas337
The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》
Language:Python36 2 37
Amanda2024/CARE-SMAC-MA_SAC
Multi-task Multi-agent Soft Actor Critic for SMAC
Language:Python12 1 01
Amanda2024/EFA-DWM
Language:Python5 1 11
Amanda2024/Papers-of-MARL
3 1 00
Amanda2024/Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
Language:Python2 0 00
Amanda2024/Papers-of-Offline-RL
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
1 0 00
Amanda2024/AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
Language:Python0 0 00
Amanda2024/Algorithm_Interview_Notes-Chinese-backups
Language:Python0 0 00
Amanda2024/attention-learn-to-route
Attention based model for learning to solve different routing problems
Language:Jupyter Notebook0 0 00
Amanda2024/Ball-Run
Language:Python1 0
Amanda2024/BladeDancer957
0 0
Amanda2024/Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
Language:Python0 0
Amanda2024/CORRO
CORRO code
Language:Python0 0
Amanda2024/Deep-Reinforcement-Learning-Algorithms
This is a reconstruction of previous repository(rl-algorithms).
Language:Python0 0
Amanda2024/DGN
DGN Code
Language:Python0 0
Amanda2024/DuaLight
Amanda2024/Flowcomm
Amanda2024/football
Check out the new game server:
Language:Python0 0
Amanda2024/GCS
The implementation of GCS
Amanda2024/LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
Amanda2024/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Amanda2024/multi-agent-PPO-on-SMAC
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
Language:Python0 0
Amanda2024/NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题
Amanda2024/on-policy
Language:Python0 0
Amanda2024/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Amanda2024/Paper_Writing_Tips
2
Amanda2024/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
Amanda2024/SEQ-SCD
Amanda2024/shap
A game theoretic approach to explain the output of any machine learning model.
Language:Jupyter Notebook0 0
Amanda2024/WeTS
A benchmark for the task of translation suggestion