sheng-han-zhang

Pinned Repositories

AIWolfPy
Language:Python1 0 00
alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
awesome-game-ai
Awesome Game AI materials of Multi-Agent Reinforcement Learning
0 0 00
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python0 0 00
Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
Language:Python00
DeepRole
The code used to power DeepRole
Language:C++0 0 00
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Language:Python0 0 00
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
Language:Python0 0 00
jynew
金庸群侠传3D重制版
Language:C#0 0 00
lykos
Werewolf, the popular detective/social party game (a theme of Mafia)
Language:Python0 0 00

sheng-han-zhang's Repositories

sheng-han-zhang/AIWolfPy
Language:Python1 0 00
sheng-han-zhang/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
sheng-han-zhang/awesome-game-ai
Awesome Game AI materials of Multi-Agent Reinforcement Learning
0 0 00
sheng-han-zhang/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python0 0 00
sheng-han-zhang/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
Language:Python00
sheng-han-zhang/DeepRole
The code used to power DeepRole
Language:C++0 0 00
sheng-han-zhang/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Language:Python0 0 00
sheng-han-zhang/FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
Language:Python0 0 00
sheng-han-zhang/jynew
金庸群侠传3D重制版
Language:C#0 0 00
sheng-han-zhang/lykos
Werewolf, the popular detective/social party game (a theme of Mafia)
Language:Python0 0 00
sheng-han-zhang/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
0 0 00
sheng-han-zhang/mcts
An implementation of Monte Carlo Tree Search in python
Language:Python0 0 00
sheng-han-zhang/mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
Language:Python
sheng-han-zhang/Megatron-LM
Ongoing research training transformer models at scale
Language:Python0 0
sheng-han-zhang/melee-ai
Super Smash Bros. Melee (SSBM) AI
Language:Python0 0
sheng-han-zhang/minerl_imitation_learning
sheng-han-zhang/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
Language:Python0 0
sheng-han-zhang/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
sheng-han-zhang/PyIMDB
In-memory database for python like a Redis(?). It's my learning sandbox of grpc.
sheng-han-zhang/rl-baselines3-zoo
A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
Language:Python0 0
sheng-han-zhang/sac-discrete-pytorch
sheng-han-zhang/sac-discrete.pytorch
A PyTorch implementation of SAC-Discrete.
Language:Python0 0
sheng-han-zhang/shakespeare
The Complete Works of William Shakespeare hosted at http://shakespeare.mit.edu/
sheng-han-zhang/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python0 0
sheng-han-zhang/tianshou
An elegant PyTorch deep reinforcement learning platform.
Language:Python0 0
sheng-han-zhang/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)