tgangwani

Ph.D. student at UIUC.

Pinned Repositories

BMIL
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
Language:Python18 3 35
Distr-A3C
Asynchronous Advantage Actor-Critic (A3C) training over a cluster using distributed TensorFlow
Language:Python10 3 02
GA3C-DeepNavigation
Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"
Language:Python63 11 025
GuidanceRewards
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
Language:Python12 3 12
IE598_RL
Reinforcement Learning assignments for IE598 (Fall'17)
Language:Python6 3 012
QDAgents
Pytorch code for "Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity", (CoRL 2020)
Language:Python7 2 11
RegAlloc
Chaitin-Briggs register-allocation algorithm (LLVM back-end)
Language:C++11 3 02
RL-Indirect-imitation
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
Language:Python19 4 44
SelfImitationDiverse
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
Language:Python19 2 02
Zorro_SMPC
Language:Python2 4 00

tgangwani's Repositories

tgangwani/GA3C-DeepNavigation
Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"
Language:Python63 11 025
tgangwani/RL-Indirect-imitation
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
Language:Python19 4 44
tgangwani/SelfImitationDiverse
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
Language:Python19 2 02
tgangwani/BMIL
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
Language:Python18 3 35
tgangwani/GuidanceRewards
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
Language:Python12 3 12
tgangwani/RegAlloc
Chaitin-Briggs register-allocation algorithm (LLVM back-end)
Language:C++11 3 02
tgangwani/Distr-A3C
Asynchronous Advantage Actor-Critic (A3C) training over a cluster using distributed TensorFlow
Language:Python10 3 02
tgangwani/QDAgents
Pytorch code for "Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity", (CoRL 2020)
Language:Python7 2 11
tgangwani/IE598_RL
Reinforcement Learning assignments for IE598 (Fall'17)
Language:Python6 3 012
tgangwani/Zorro_SMPC
Language:Python2 4 00
tgangwani/AILO
1 3 2
tgangwani/FaultRecovery
(Graph) Fault Tolerance for Apache Hama
Language:Java1 2 0
tgangwani/PartialRedundancyElimination
Partial Redundancy Elimination Pass in LLVM
Language:TeX1 2 0
tgangwani/DynReconfig
Machine-Learning aided dynamic hardware reconfiguration
Language:Python2 0
tgangwani/Ethereum-PPV-contract
Smart contract for the Ethereum platform in Solidity
Language:Python2 0
tgangwani/ethereumlab
Language:TeX2 0
tgangwani/GraphColoring
Parallel Graph Coloring using Charm++
Language:C3 05
tgangwani/Insight-data-challenge
Insight Data Challenge 2016
Language:C++2 0
tgangwani/Miscellaneous
Mix bunch of ML codes, web scripts etc.
Language:Python2 0

tgangwani

Pinned Repositories

BMIL

Distr-A3C

GA3C-DeepNavigation

GuidanceRewards

IE598_RL

QDAgents

RegAlloc

RL-Indirect-imitation

SelfImitationDiverse

Zorro_SMPC

tgangwani's Repositories

tgangwani/GA3C-DeepNavigation

tgangwani/RL-Indirect-imitation

tgangwani/SelfImitationDiverse

tgangwani/BMIL

tgangwani/GuidanceRewards

tgangwani/RegAlloc

tgangwani/Distr-A3C

tgangwani/QDAgents

tgangwani/IE598_RL

tgangwani/Zorro_SMPC

tgangwani/AILO

tgangwani/FaultRecovery

tgangwani/PartialRedundancyElimination

tgangwani/DynReconfig

tgangwani/Ethereum-PPV-contract

tgangwani/ethereumlab

tgangwani/GraphColoring

tgangwani/Insight-data-challenge

tgangwani/Miscellaneous