hsvgbkhgbv

I am a Senior Research Associate focusing on multi-agent reinforcement learning and ad hoc teamwork.

University of Bristol / University of ManchesterBristol

Pinned Repositories

MAPDN
This repository is for an open-source environment for multi-agent active voltage control on power distribution networks (MAPDN).
Language:Python211 3 4055
CIAO
This repository includes the implementation of the ICML 2024 paper titled "Open Ad Hoc Teamwork with Cooperative Game Theory."
Language:Python2 1 00
Matlab-Implement-HMM
This project implements HMM trained by EM and decoded by Viterbi.
Language:MATLAB10
Mean-field-Fictitious-Play-in-Potential-Games
Mean-field Fictitious Play in Potential Games
Language:Python60
PyPSA
PyPSA: Python for Power System Analysis
Language:Python2 1 00
shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
Language:Python42 2 013
Snore-Sound-Classification-by-Deep-Learning
This is the implementation for the paper: A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification.
Language:Python20 2 05
SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Language:Python116 4 1042
Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
Language:Python10 3 11
HDNO
This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.
Language:Python18 3 01

hsvgbkhgbv's Repositories

hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Language:Python116 4 1042
hsvgbkhgbv/shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
Language:Python42 2 013
hsvgbkhgbv/Snore-Sound-Classification-by-Deep-Learning
This is the implementation for the paper: A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification.
Language:Python20 2 05
hsvgbkhgbv/CIAO
This repository includes the implementation of the ICML 2024 paper titled "Open Ad Hoc Teamwork with Cooperative Game Theory."
Language:Python2 1 00
hsvgbkhgbv/PyPSA
PyPSA: Python for Power System Analysis
Language:Python2 1 00
hsvgbkhgbv/naht-dev
Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).
Language:Jupyter Notebook11
hsvgbkhgbv/comix
Language:Python1 0
hsvgbkhgbv/ConvLab
DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:
Language:Python2 0
hsvgbkhgbv/dcg
Language:Python1 0
hsvgbkhgbv/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
Language:Python0 0
hsvgbkhgbv/Fair-MARL
Cooperation and Fairness in Multi-Agent Reinforcement Learning
hsvgbkhgbv/graph-marl
Multi-Agent Reinforcement Learning in Graphs
hsvgbkhgbv/hsvgbkhgbv.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript
hsvgbkhgbv/InforMARL-dev
Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
hsvgbkhgbv/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python0 0
hsvgbkhgbv/larl_trial
Language:Python
hsvgbkhgbv/lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
Language:Python0 0
hsvgbkhgbv/MADRaS
Multi-Agent DRiving Simulator
Language:Python1 0
hsvgbkhgbv/master_thesis
hsvgbkhgbv/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
Language:Python2 0
hsvgbkhgbv/multiagent-particle-envs
Language:Python1 0
hsvgbkhgbv/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
Language:Python2 0
hsvgbkhgbv/plato-research-dialogue-system
This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
Language:Python2 0
hsvgbkhgbv/poppy
:hibiscus: Population-Based Reinforcement Learning for Combinatorial Optimization
Language:Python0 0
hsvgbkhgbv/PyBoy
Game Boy emulator written in Python
hsvgbkhgbv/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python2 01
hsvgbkhgbv/random-network-distillation-pytorch
Random Network Distillation pytorch
Language:Python1 0
hsvgbkhgbv/safety-gym
Tools for accelerating safe exploration research.
Language:Python1 0
hsvgbkhgbv/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
Language:Python1 0
hsvgbkhgbv/wqmix
Code for Weighted QMIX