hsvgbkhgbv
I am a Senior Research Associate focusing on multi-agent reinforcement learning and ad hoc teamwork.
University of Bristol / University of ManchesterBristol
Pinned Repositories
MAPDN
This repository is for an open-source environment for multi-agent active voltage control on power distribution networks (MAPDN).
CIAO
This repository includes the implementation of the ICML 2024 paper titled "Open Ad Hoc Teamwork with Cooperative Game Theory."
Matlab-Implement-HMM
This project implements HMM trained by EM and decoded by Viterbi.
Mean-field-Fictitious-Play-in-Potential-Games
Mean-field Fictitious Play in Potential Games
PyPSA
PyPSA: Python for Power System Analysis
shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
Snore-Sound-Classification-by-Deep-Learning
This is the implementation for the paper: A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification.
SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
HDNO
This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.
hsvgbkhgbv's Repositories
hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
hsvgbkhgbv/shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
hsvgbkhgbv/Snore-Sound-Classification-by-Deep-Learning
This is the implementation for the paper: A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification.
hsvgbkhgbv/CIAO
This repository includes the implementation of the ICML 2024 paper titled "Open Ad Hoc Teamwork with Cooperative Game Theory."
hsvgbkhgbv/PyPSA
PyPSA: Python for Power System Analysis
hsvgbkhgbv/naht-dev
Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).
hsvgbkhgbv/comix
hsvgbkhgbv/ConvLab
DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:
hsvgbkhgbv/dcg
hsvgbkhgbv/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
hsvgbkhgbv/Fair-MARL
Cooperation and Fairness in Multi-Agent Reinforcement Learning
hsvgbkhgbv/graph-marl
Multi-Agent Reinforcement Learning in Graphs
hsvgbkhgbv/hsvgbkhgbv.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
hsvgbkhgbv/InforMARL-dev
Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
hsvgbkhgbv/JaxMARL
Multi-Agent Reinforcement Learning with JAX
hsvgbkhgbv/larl_trial
hsvgbkhgbv/lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
hsvgbkhgbv/MADRaS
Multi-Agent DRiving Simulator
hsvgbkhgbv/master_thesis
hsvgbkhgbv/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
hsvgbkhgbv/multiagent-particle-envs
hsvgbkhgbv/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
hsvgbkhgbv/plato-research-dialogue-system
This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
hsvgbkhgbv/poppy
:hibiscus: Population-Based Reinforcement Learning for Combinatorial Optimization
hsvgbkhgbv/PyBoy
Game Boy emulator written in Python
hsvgbkhgbv/pymarl
Python Multi-Agent Reinforcement Learning framework
hsvgbkhgbv/random-network-distillation-pytorch
Random Network Distillation pytorch
hsvgbkhgbv/safety-gym
Tools for accelerating safe exploration research.
hsvgbkhgbv/TextWorld
​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
hsvgbkhgbv/wqmix
Code for Weighted QMIX