vivianisvivian's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ljpzzz/machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
tomlooman/ActionRoguelike
Third-person Action Roguelike made in Unreal Engine C++. Project for Unreal Engine C++ Course & Stanford University
maicss/PyQt-Chinese-tutorial
PyQt6中文教程
NeuronDance/DeepRL
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
YunYang1994/TensorFlow2.0-Examples
🙄 Difficult algorithm, Simple code.
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
qqiang00/Reinforce
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
RITCHIEHuang/DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
wangzhaode/llm-export
llm-export can export llm model to onnx.
xtma/pytorch_car_caring
Reinforcement Learning for Gym CarRacing-v0 with PyTorch
adik993/ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
wangshusen/deep-rl
Collection of Deep Reinforcement Learning algorithms
XinJingHao/TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
eivindeb/fixed-wing-gym
lingjiameng/FGAutopilot
Flightgear autopilot with DRL using Python, (Contain Python Interface for FlightGear)
krisgry/x8
yufeiwang63/RLlab
pytorch implementation of DQN, NAF, DDPG
opt12/Markov-Pilot
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model