lz010866

lz010866's Stars

kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.3k443
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python64.4k8k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54.3k5.6k
PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
Language:Python3.3k820
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.3k785
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版
Language:Jupyter Notebook11.5k1.4k
morningsky/NTU-ReinforcementLearning-Notes
国立**大学李宏毅老师讲解的深度强化学习学习笔记
Language:Python12223
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook9.2k1.8k
lizhuoq/WeatherLearn
Implementation of the PyTorch version of the Weather Deep Learning Model Zoo.
Language:Python4713
zhaoshan2/pangu-pytorch
Weather forecast at 24-hour horizon
Language:Python144
Clarmy/pangu-weather-verify
Validation of the Pangu Weather Forecasting Model against Real-World Meteorological Observations
Language:Python5912
HaxyMoly/Pangu-Weather-ReadyToGo
盘古天气大模型全流程演示（输入数据准备、预测及结果可视化）Unofficial demonstration of Huawei's Pangu Weather Model. Implementing the entire process of data preparation for input, forecasting conversion of forecasted results, and visualization.
Language:Python14929
198808xc/Pangu-Weather
An official implementation of Pangu-Weather
Language:Python1.1k197
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language:Python1.2k123
google-research/football
Check out the new game server:
Language:Python3.3k1.3k
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Language:Python1.4k279
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Language:Python1k174
TimeBreaker/Multi-Agent-Reinforcement-Learning-papers
Multi-Agent Reinforcement Learning (MARL) papers
19833
TimeBreaker/MARL-papers-with-code
Multi-Agent Reinforcement Learning (MARL) papers with code
29837
TimeBreaker/MARL-resources-collection
A Collection of Multi-Agent Reinforcement Learning (MARL) Resources
19610