Pinned Repositories
AIWolfPy
alpaca-lora
Instruct-tune LLaMA on consumer hardware
awesome-game-ai
Awesome Game AI materials of Multi-Agent Reinforcement Learning
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
DeepRole
The code used to power DeepRole
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
jynew
金庸群侠传3D重制版
lykos
Werewolf, the popular detective/social party game (a theme of Mafia)
sheng-han-zhang's Repositories
sheng-han-zhang/AIWolfPy
sheng-han-zhang/alpaca-lora
Instruct-tune LLaMA on consumer hardware
sheng-han-zhang/awesome-game-ai
Awesome Game AI materials of Multi-Agent Reinforcement Learning
sheng-han-zhang/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
sheng-han-zhang/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
sheng-han-zhang/DeepRole
The code used to power DeepRole
sheng-han-zhang/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
sheng-han-zhang/FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
sheng-han-zhang/jynew
金庸群侠传3D重制版
sheng-han-zhang/lykos
Werewolf, the popular detective/social party game (a theme of Mafia)
sheng-han-zhang/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
sheng-han-zhang/mcts
An implementation of Monte Carlo Tree Search in python
sheng-han-zhang/mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
sheng-han-zhang/Megatron-LM
Ongoing research training transformer models at scale
sheng-han-zhang/melee-ai
Super Smash Bros. Melee (SSBM) AI
sheng-han-zhang/minerl_imitation_learning
sheng-han-zhang/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
sheng-han-zhang/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
sheng-han-zhang/PyIMDB
In-memory database for python like a Redis(?). It's my learning sandbox of grpc.
sheng-han-zhang/rl-baselines3-zoo
A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
sheng-han-zhang/sac-discrete-pytorch
sheng-han-zhang/sac-discrete.pytorch
A PyTorch implementation of SAC-Discrete.
sheng-han-zhang/shakespeare
The Complete Works of William Shakespeare hosted at http://shakespeare.mit.edu/
sheng-han-zhang/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
sheng-han-zhang/tianshou
An elegant PyTorch deep reinforcement learning platform.
sheng-han-zhang/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)