Pinned Repositories
C-HMCNN
Code for paper: "Coherent Hierarchical Multi-Label Classification Networks"
DeepPath_GYM
code and docs for my EMNLP paper "DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning"
DILP-Core
Python and TensorFlow implementation of the paper "Learning Explanatory Rules from Noisy Data." Evans Richard and Edward Grefenstette. Journal of Artificial Intelligence Research 61 (2018): 1-64.
dilp-stratified-negation
EEE591_HW-
EPG
Code for the paper "Evolved Policy Gradients"
gym
A toolkit for developing and comparing reinforcement learning algorithms.
learn2learn_project
A PyTorch Library for Meta-learning Research
MILP-appendix
recsim
A Configurable Recommender Systems Simulation Platform
aniruddha123reinforcement's Repositories
aniruddha123reinforcement/MILP-appendix
aniruddha123reinforcement/C-HMCNN
Code for paper: "Coherent Hierarchical Multi-Label Classification Networks"
aniruddha123reinforcement/EEE591_HW-
aniruddha123reinforcement/RL4LMs
A modular RL library to fine-tune language models to human preferences
aniruddha123reinforcement/RL4LMs_2
aniruddha123reinforcement/DILP-Core
Python and TensorFlow implementation of the paper "Learning Explanatory Rules from Noisy Data." Evans Richard and Edward Grefenstette. Journal of Artificial Intelligence Research 61 (2018): 1-64.
aniruddha123reinforcement/dilp-stratified-negation
aniruddha123reinforcement/SQUIRE
EMNLP 22': SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning
aniruddha123reinforcement/SQUIRE_new
aniruddha123reinforcement/DeepPath_GYM
code and docs for my EMNLP paper "DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning"
aniruddha123reinforcement/gym
A toolkit for developing and comparing reinforcement learning algorithms.
aniruddha123reinforcement/learn2learn_project
A PyTorch Library for Meta-learning Research
aniruddha123reinforcement/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
aniruddha123reinforcement/recsim
A Configurable Recommender Systems Simulation Platform
aniruddha123reinforcement/stable_baselines
aniruddha123reinforcement/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
aniruddha123reinforcement/EPG
Code for the paper "Evolved Policy Gradients"