Pinned Repositories
acme.sh
A pure Unix shell script implementing ACME client protocol
CyberBattleSim
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
MCTS
Python Implementations of Monte Carlo Tree Search
MOSS
An open-source tool-augmented conversational language model from Fudan University
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
pymarl
Python Multi-Agent Reinforcement Learning framework
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
zzzzzzJg's Repositories
zzzzzzJg/MOSS
An open-source tool-augmented conversational language model from Fudan University
zzzzzzJg/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
zzzzzzJg/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
zzzzzzJg/acme.sh
A pure Unix shell script implementing ACME client protocol
zzzzzzJg/shooter3d_env
zzzzzzJg/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
zzzzzzJg/CyberBattleSim
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.
zzzzzzJg/pymarl
Python Multi-Agent Reinforcement Learning framework
zzzzzzJg/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
zzzzzzJg/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
zzzzzzJg/MCTS
Python Implementations of Monte Carlo Tree Search