DavidWang527's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
sdmg15/Best-websites-a-programmer-should-visit
:link: Some useful websites for programmers.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
yoheinakajima/babyagi
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
tuteng/Best-websites-a-programmer-should-visit-zh
程序员应该访问的最佳网站中文版
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
PKU-YuanGroup/ChatLaw
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
rtqichen/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
circlestarzero/EX-chatGPT
Let ChatGPT truly learn how to go online and call APIs! 'EX-ChatGPT' can rival and even surpass NewBing
openai/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
ORDINAND/The-Art-of-Asking-ChatGPT-for-High-Quality-Answers-A-complete-Guide-to-Prompt-Engineering-Technique
ChatGPT提问技巧
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
XinJingHao/Deep-Reinforcement-Learning-Algorithms-with-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Bigpig4396/Multi-Agent-Reinforcement-Learning-Environment
Hello, I pushed some python environments for Multi Agent Reinforcement Learning.
eleurent/rl-agents
Implementations of Reinforcement Learning and Planning algorithms
shifujun/UESTCthesis
电子科技大学毕设设计论文LaTeX模板
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
kaixindelele/ChatOpenReview
Crowdfunding open source projects: use OpenReview's high-quality review data to fine-tune a professional review and response LLM. 众筹开源项目:利用OpenReview的优质审稿数据,微调出一个专业的审稿和审稿回复GPT
binary-husky/unreal-map
Multiagent research environment toolbox based on Unreal Engine
hzeyuan/OpenGPTS
OpenGPTs- Powerful GPTs Colipot | 强大的gpts浏览器插件|多窗口|批量对话|chatgpt3.5|chatgpt4.0
binary-husky/hmp2g
Multiagent Reinforcement Learning Research Project
sumitsk/marl_transfer
Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)
BIT-aerial-robotics/AquaML
binary-husky/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文总结+润色+审稿+审稿回复
Haichao-Zhang/PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
oxwhirl/comix
binary-husky/Chinese-ChatLLaMA
中文LLaMA基础模型;中文ChatLLaMA对话模型;NLP预训练/指令微调数据集