AndyYue1893

Stay hungry, stay foolish

AndyYue1893's Stars

floodsung/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Language:Python38k 2.1k 537.3k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37k 429 1.6k3.2k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.6k 382 1782k
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C17.2k 190 2202.1k
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python13.5k 555 994.8k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.8k 63 1.5k1.7k
aikorea/awesome-rl
Reinforcement learning resources curated
8.8k 441 121.8k
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python8.8k 96 1812k
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
Language:Python3.7k 31 247440
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 67 229829
huawei-noah/HEBO
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
Language:Jupyter Notebook3.2k 340 49583
microsoft/PromptCraft-Robotics
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
Language:Python1.8k 42 13199
JSBSim-Team/jsbsim
An open source flight dynamics & control software library
Language:C++1.3k 55 331448
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.3k 18 84119
utiasDSL/gym-pybullet-drones
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
Language:Python1.2k 16 186350
uzh-rpg/flightmare
An Open Flexible Quadrotor Simulator
Language:C++991 32 168343
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python912 38 103130
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python884 10 153142
m-lundberg/simple-pid
A simple and easy to use PID controller in Python
Language:Python768 20 55209
PKU-Alignment/safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Language:Python382 9 2553
liuqh16/CloseAirCombat
An environment based on JSBSIM aimed at one-to-one close air combat.
Language:Python250 5 3978
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
221 8 39
AGI-Edgerunners/LLM-Optimizers-Papers
Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.
213 6 218
Gor-Ren/gym-jsbsim
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
Language:Python172 13 1585
AndyYue1893/COVID-19-SEIR-LSTM
本项目实现2019新型冠状病毒肺炎预测，分别采用经典传染病动力学模型SEIR和LSTM神经网络实现，通过控制模型参数来改变干预程度，体现防控的意义。
Language:Python105 3 326
maohangyu/TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
Language:Python53 4 24
maohangyu/marl_demo
demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention Multi-Agent DDPG) and NCC-MARL (Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning).
Language:Python43 1 05
Theohhhu/CloseAirCombat_baseline
An environment based on JSBSIM aimed at one-to-one close air combat.
Language:Python8 0 01
heronsystems/gym-jsbsim-f16
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
Language:Jupyter Notebook6 2 03
PKU-MARL/MARLlib
This code base enables multi-agent RL in the RLlib
Language:Python5 0 01

AndyYue1893

AndyYue1893's Stars

floodsung/Deep-Learning-Papers-Reading-Roadmap

LAION-AI/Open-Assistant

microsoft/JARVIS

karpathy/llama2.c

ShangtongZhang/reinforcement-learning-an-introduction

DLR-RM/stable-baselines3

aikorea/awesome-rl

jadore801120/attention-is-all-you-need-pytorch

mymusise/ChatGLM-Tuning

ikostrikov/pytorch-a2c-ppo-acktr-gail

huawei-noah/HEBO

microsoft/PromptCraft-Robotics

JSBSim-Team/jsbsim

PKU-Alignment/safe-rlhf

utiasDSL/gym-pybullet-drones

uzh-rpg/flightmare

PKU-Alignment/omnisafe

Replicable-MARL/MARLlib

m-lundberg/simple-pid

PKU-Alignment/safety-gymnasium

liuqh16/CloseAirCombat

floodsung/LLM-with-RL-papers

AGI-Edgerunners/LLM-Optimizers-Papers

Gor-Ren/gym-jsbsim

AndyYue1893/COVID-19-SEIR-LSTM

maohangyu/TIT_open_source

maohangyu/marl_demo

Theohhhu/CloseAirCombat_baseline

heronsystems/gym-jsbsim-f16

PKU-MARL/MARLlib