zzzzzzJg

Pinned Repositories

acme.sh
A pure Unix shell script implementing ACME client protocol
Language:Shell00
CyberBattleSim
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.
Language:Jupyter Notebook00
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python00
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python00
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python00
MCTS
Python Implementations of Monte Carlo Tree Search
Language:Python00
MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python00
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python00
pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python00
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Language:Python00

zzzzzzJg's Repositories

zzzzzzJg/MOSS
An open-source tool-augmented conversational language model from Fudan University
zzzzzzJg/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
zzzzzzJg/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
zzzzzzJg/acme.sh
A pure Unix shell script implementing ACME client protocol
zzzzzzJg/shooter3d_env
zzzzzzJg/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
zzzzzzJg/CyberBattleSim
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.
zzzzzzJg/pymarl
Python Multi-Agent Reinforcement Learning framework
zzzzzzJg/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
zzzzzzJg/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
zzzzzzJg/MCTS
Python Implementations of Monte Carlo Tree Search