Pinned Repositories
BMIL
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
Distr-A3C
Asynchronous Advantage Actor-Critic (A3C) training over a cluster using distributed TensorFlow
GA3C-DeepNavigation
Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"
GuidanceRewards
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
IE598_RL
Reinforcement Learning assignments for IE598 (Fall'17)
QDAgents
Pytorch code for "Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity", (CoRL 2020)
RegAlloc
Chaitin-Briggs register-allocation algorithm (LLVM back-end)
RL-Indirect-imitation
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
SelfImitationDiverse
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
Zorro_SMPC
tgangwani's Repositories
tgangwani/GA3C-DeepNavigation
Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"
tgangwani/RL-Indirect-imitation
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
tgangwani/SelfImitationDiverse
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
tgangwani/BMIL
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
tgangwani/GuidanceRewards
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
tgangwani/RegAlloc
Chaitin-Briggs register-allocation algorithm (LLVM back-end)
tgangwani/Distr-A3C
Asynchronous Advantage Actor-Critic (A3C) training over a cluster using distributed TensorFlow
tgangwani/QDAgents
Pytorch code for "Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity", (CoRL 2020)
tgangwani/IE598_RL
Reinforcement Learning assignments for IE598 (Fall'17)
tgangwani/Zorro_SMPC
tgangwani/AILO
tgangwani/FaultRecovery
(Graph) Fault Tolerance for Apache Hama
tgangwani/PartialRedundancyElimination
Partial Redundancy Elimination Pass in LLVM
tgangwani/DynReconfig
Machine-Learning aided dynamic hardware reconfiguration
tgangwani/Ethereum-PPV-contract
Smart contract for the Ethereum platform in Solidity
tgangwani/ethereumlab
tgangwani/GraphColoring
Parallel Graph Coloring using Charm++
tgangwani/Insight-data-challenge
Insight Data Challenge 2016
tgangwani/Miscellaneous
Mix bunch of ML codes, web scripts etc.