Pinned Repositories
AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
Algorithm_Interview_Notes-Chinese-backups
attention-learn-to-route
Attention based model for learning to solve different routing problems
CARE-SMAC-MA_SAC
Multi-task Multi-agent Soft Actor Critic for SMAC
Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
EFA-DWM
GCS_aamas337
The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》
Papers-of-MARL
Papers-of-Offline-RL
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
PyTorch-Tutorial
Build your neural network easy and fast
Amanda2024's Repositories
Amanda2024/GCS_aamas337
The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》
Amanda2024/CARE-SMAC-MA_SAC
Multi-task Multi-agent Soft Actor Critic for SMAC
Amanda2024/EFA-DWM
Amanda2024/Papers-of-MARL
Amanda2024/Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
Amanda2024/Papers-of-Offline-RL
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
Amanda2024/AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
Amanda2024/Algorithm_Interview_Notes-Chinese-backups
Amanda2024/attention-learn-to-route
Attention based model for learning to solve different routing problems
Amanda2024/Ball-Run
Amanda2024/BladeDancer957
Amanda2024/Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
Amanda2024/CORRO
CORRO code
Amanda2024/Deep-Reinforcement-Learning-Algorithms
This is a reconstruction of previous repository(rl-algorithms).
Amanda2024/DGN
DGN Code
Amanda2024/DuaLight
Amanda2024/Flowcomm
Amanda2024/football
Check out the new game server:
Amanda2024/GCS
The implementation of GCS
Amanda2024/LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
Amanda2024/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Amanda2024/multi-agent-PPO-on-SMAC
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
Amanda2024/NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题
Amanda2024/on-policy
Amanda2024/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Amanda2024/Paper_Writing_Tips
Amanda2024/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
Amanda2024/SEQ-SCD
Amanda2024/shap
A game theoretic approach to explain the output of any machine learning model.
Amanda2024/WeTS
A benchmark for the task of translation suggestion