keerthanss's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
google-deepmind/dnc
A TensorFlow implementation of the Differentiable Neural Computer.
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
nammayatri/nammayatri
A Direct-to-Driver open mobility platform powering the next-generation of mobility applications in India.
Eric-mingjie/rethinking-network-pruning
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
wowthemesnet/mediumish-theme-jekyll
Jekyll Template - Mediumish
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
williamFalcon/DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
clvrai/awesome-rl-envs
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
sgossner/VSCO-2-CE
An open-source orchestral library
voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
keiohta/tf2rl
TensorFlow2 Reinforcement Learning
clojurians-org/haskell-ebook
TianhongDai/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
google-research/realworldrl_suite
Real-World RL Benchmark Suite
kindredresearch/SenseAct
SenseAct: A computational framework for developing real-world robot learning tasks
mbchang/dynamics
A Compositional Object-Based Approach to Learning Physical Dynamics
facebookresearch/impact-driven-exploration
impact-driven-exploration
bgavran/DNC
Implementation of the Differentiable Neural Computer in Tensorflow
grantsrb/Gym-Snake
An OpenAI gym environment made for RL
kachayev/pyage2
"Age of Empires II" Learning Environment
anshul3899/SPSA-Net
A numpy implementation of SPSA for optimizing neural networks
AkshayGurudath/Checkmate-with-Rook
This repository is for using DRL to checkmate a king with the help of a rook within 50 moves.