keerthanss

Bengaluru, India

keerthanss's Stars

pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python84.5k 1.7k 46.9k22.8k
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python15.8k 645 8504.9k
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++4.3k 107 559937
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++3.6k 181 1.3k1k
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 66 229826
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python2.9k 49 40464
google-deepmind/dnc
A TensorFlow implementation of the Differentiable Neural Computer.
Language:Python2.5k 165 37441
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python2.2k 24 59190
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.7k 9 61351
nammayatri/nammayatri
A Direct-to-Driver open mobility platform powering the next-generation of mobility applications in India.
Language:PureScript1.6k 14 2.6k181
Eric-mingjie/rethinking-network-pruning
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
Language:Python1.5k 33 54293
wowthemesnet/mediumish-theme-jekyll
Jekyll Template - Mediumish
Language:JavaScript1.3k 21 1011.5k
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1.1k 26 36189
williamFalcon/DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
1.1k 50 1122
clvrai/awesome-rl-envs
1.1k 31 484
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
686 44 191
sgossner/VSCO-2-CE
An open-source orchestral library
563 39 775
voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Language:Python544 11 2360
keiohta/tf2rl
TensorFlow2 Reinforcement Learning
Language:Python467 18 120103
clojurians-org/haskell-ebook
402 23 081
TianhongDai/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Language:Python400 6 2975
google-research/realworldrl_suite
Real-World RL Benchmark Suite
Language:Python347 14 429
kindredresearch/SenseAct
SenseAct: A computational framework for developing real-world robot learning tasks
Language:Python216 11 2942
mbchang/dynamics
A Compositional Object-Based Approach to Learning Physical Dynamics
Language:Lua169 10 417
facebookresearch/impact-driven-exploration
impact-driven-exploration
Language:Python128 9 727
bgavran/DNC
Implementation of the Differentiable Neural Computer in Tensorflow
Language:Python117 12 818
grantsrb/Gym-Snake
An OpenAI gym environment made for RL
Language:Python65 2 329
kachayev/pyage2
"Age of Empires II" Learning Environment
Language:Python65 7 18
anshul3899/SPSA-Net
A numpy implementation of SPSA for optimizing neural networks
Language:Python6 1 10
AkshayGurudath/Checkmate-with-Rook
This repository is for using DRL to checkmate a king with the help of a rook within 50 moves.
Language:Python12

keerthanss

keerthanss's Stars

pytorch/pytorch

openai/baselines

google-deepmind/open_spiel

oneapi-src/oneDNN

ikostrikov/pytorch-a2c-ppo-acktr-gail

seungeunrho/minimalRL

google-deepmind/dnc

allenai/RL4LMs

nikhilbarhate99/PPO-PyTorch

nammayatri/nammayatri

Eric-mingjie/rethinking-network-pruning

wowthemesnet/mediumish-theme-jekyll

Khrylx/PyTorch-RL

williamFalcon/DeepRLHacks

clvrai/awesome-rl-envs

saltudelft/ml4se

sgossner/VSCO-2-CE

voidful/TextRL

keiohta/tf2rl

clojurians-org/haskell-ebook

TianhongDai/hindsight-experience-replay

google-research/realworldrl_suite

kindredresearch/SenseAct

mbchang/dynamics

facebookresearch/impact-driven-exploration

bgavran/DNC

grantsrb/Gym-Snake

kachayev/pyage2

anshul3899/SPSA-Net

AkshayGurudath/Checkmate-with-Rook