Pinned Repositories
a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
crVAE
[WACV2018] Channel-Recurrent Autoencoding for Image Modeling
curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.
eccv16_attr2img
Torch implementing of attribute2image project
ecml19_sa3c
[ECML2019] Stochastic Actor Critic Methods
ELF
An End-To-End, Lightweight and Flexible Platform for Game Research
flare
Reinforcement Learning with Latent Flow
gym-minigrid
Minimalistic gridworld environment for OpenAI Gym
wacv19_acVAE
[WACV2019] Attentive Attribute-Conditioned Channel-Recurrent Autoencoding
WendyShang's Repositories
WendyShang/flare
Reinforcement Learning with Latent Flow
WendyShang/crVAE
[WACV2018] Channel-Recurrent Autoencoding for Image Modeling
WendyShang/dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.
WendyShang/ecml19_sa3c
[ECML2019] Stochastic Actor Critic Methods
WendyShang/wacv19_acVAE
[WACV2019] Attentive Attribute-Conditioned Channel-Recurrent Autoencoding
WendyShang/a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
WendyShang/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
WendyShang/eccv16_attr2img
Torch implementing of attribute2image project
WendyShang/ELF
An End-To-End, Lightweight and Flexible Platform for Game Research
WendyShang/gym-minigrid
Minimalistic gridworld environment for OpenAI Gym
WendyShang/maxtext
A simple, performant and scalable Jax LLM!
WendyShang/miniF2F-1
Formal to Formal Mathematics Benchmark
WendyShang/mistral_humaneval_script
a simple script to evaluate mistral API human eval models
WendyShang/Neural-Photo-Editor
A simple interface for editing natural photos with generative neural networks.
WendyShang/pyclient_cppbatcher
This is a simple, minimal example of requesting from python client and process request in batch on cpp side.
WendyShang/rad
RAD: Reinforcement Learning with Augmented Data
WendyShang/ray_on_slurm
WendyShang/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
WendyShang/video_stat_dyna