Pinned Repositories
minirts
We release dataset collected for our research, code that implement neural network models described in the paper, and scripts to reproduce all of our results, and visualization tool for visualize dataset.
off-belief-learning
Implementation of the Off Belief Learning algorithm.
a2c
bottom-up-attention-vqa
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
dem
Building deep energy models
ibrl
instruct-rl
jax-vs-pytorch
monometis
rainbow
A PyTorch implementation of Rainbow DQN agent
hengyuan-hu's Repositories
hengyuan-hu/bottom-up-attention-vqa
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
hengyuan-hu/rainbow
A PyTorch implementation of Rainbow DQN agent
hengyuan-hu/a2c
hengyuan-hu/ibrl
hengyuan-hu/jax-vs-pytorch
hengyuan-hu/instruct-rl
hengyuan-hu/dem
Building deep energy models
hengyuan-hu/monometis
hengyuan-hu/nogil-pybind
hengyuan-hu/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
hengyuan-hu/hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
hengyuan-hu/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
hengyuan-hu/optim-weight-norm
hengyuan-hu/vqa
hengyuan-hu/dqn-hw
hengyuan-hu/cs236project
hengyuan-hu/custom-keras
hengyuan-hu/dm_env_rpc
A networking protocol for agent-environment communication
hengyuan-hu/emacs-dot
hengyuan-hu/gadem
hengyuan-hu/hanabi-live
A web server that allows people to play Hanabi, a cooperative card game of logic and reasoning.
hengyuan-hu/hanabi-live-bot
An example bot for the Hanabi Live website written in Python
hengyuan-hu/hengyuan-hu.github.io
hengyuan-hu/ibrl-web
Website for Imitation Bootstrapped Reinforcement Learning
hengyuan-hu/LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
hengyuan-hu/minimalm
hengyuan-hu/net-trim
hengyuan-hu/nogil-hanabi
hengyuan-hu/pybind11
Seamless operability between C++11 and Python
hengyuan-hu/robot-lightning
Robot Controllers for Research Lightning