AndyYue1893's Stars
floodsung/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
karpathy/llama2.c
Inference Llama 2 in one file of pure C
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
aikorea/awesome-rl
Reinforcement learning resources curated
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
huawei-noah/HEBO
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
microsoft/PromptCraft-Robotics
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
JSBSim-Team/jsbsim
An open source flight dynamics & control software library
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
utiasDSL/gym-pybullet-drones
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
uzh-rpg/flightmare
An Open Flexible Quadrotor Simulator
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
m-lundberg/simple-pid
A simple and easy to use PID controller in Python
PKU-Alignment/safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
liuqh16/CloseAirCombat
An environment based on JSBSIM aimed at one-to-one close air combat.
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
AGI-Edgerunners/LLM-Optimizers-Papers
Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.
Gor-Ren/gym-jsbsim
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
AndyYue1893/COVID-19-SEIR-LSTM
本项目实现2019新型冠状病毒肺炎预测,分别采用经典传染病动力学模型SEIR和LSTM神经网络实现,通过控制模型参数来改变干预程度,体现防控的意义。
maohangyu/TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
maohangyu/marl_demo
demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention Multi-Agent DDPG) and NCC-MARL (Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning).
Theohhhu/CloseAirCombat_baseline
An environment based on JSBSIM aimed at one-to-one close air combat.
heronsystems/gym-jsbsim-f16
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
PKU-MARL/MARLlib
This code base enables multi-agent RL in the RLlib