kiankyars's Stars
ossu/computer-science
🎓 Path to a free self-taught education in Computer Science!
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
google-deepmind/alphafold
Open source code for AlphaFold 2.
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
MorvanZhou/PyTorch-Tutorial
Build your neural network easy and fast, 莫烦Python中文教学
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
jacobhilton/deep_learning_curriculum
Language model alignment-focused deep learning curriculum
philtabor/Youtube-Code-Repository
Repository for most of the code from my YouTube channel
MorvanZhou/pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
qgallouedec/panda-gym
Set of robotic environments based on PyBullet physics engine and gymnasium.
Fulgurus/candy-machine-v2-responsive-ui
Solana Candy Machine V2 with a Prod-ready and easy to customize responsive UI.
jachiam/rl-intro
jihoonog/School-Notes-Public
This is the public repository of all my school notes from the University of Alberta (and some from the University of Lethbridge)
yfletberliac/rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
stefanbo92/A3C-Continuous
Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space
orrivlin/MountainCar_DQN_RND
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
aarctan/schedubuddy-web
open-spaced-repetition/fsrs4remnote
A modern RemNote scheduler plugin based on free spaced repetition scheduler algorithm
gouxiangchen/ac-ppo
Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment
jinglescode/reinforcement-learning-tic-tac-toe
A reinforcement learning algorithm for agents to learn the tic-tac-toe, using the value function.
simonbogh/rl_panda_gym_pybullet_example
OpenAI gym, pybullet, panda-gym example
eyalbd2/Deep_RL_Course
hugomarins/fsrs4remnoteFork
A modern RemNote scheduler plugin based on free spaced repetition scheduler algorithm
michaelfromyeg/midtermr
A lil' project to generate UBC MATH midterm and final exams. For HackCamp 2022.
rrokhit/uber-lyft
Uber vs Lyft Fare Comparison