Pinned Repositories
alfred-aws-icons
Alfred Workflow for quickly pasting AWS architecture icons onto PowerPoint.
discor.pytorch
PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.
fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
rljax
A collection of RL algorithms written in JAX.
rltorch
A simple framework for distributed reinforcement learning in PyTorch.
sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
vae.pytorch
PyTorch Implementation of Deep Feature Consistent Variational Autoencoder.
toshikwa's Repositories
toshikwa/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
toshikwa/gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
toshikwa/fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
toshikwa/soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
toshikwa/rljax
A collection of RL algorithms written in JAX.
toshikwa/slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
toshikwa/discor.pytorch
PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.
toshikwa/alfred-aws-icons
Alfred Workflow for quickly pasting AWS architecture icons onto PowerPoint.
toshikwa/rltorch
A simple framework for distributed reinforcement learning in PyTorch.
toshikwa/vae.pytorch
PyTorch Implementation of Deep Feature Consistent Variational Autoencoder.
toshikwa/simple-rl.pytorch
Simple implementation of model-free RL algorithms written in PyTorch.
toshikwa/wappo.pytorch
PyTorch implementation of Wasserstein Adversarial Proximal Policy Optimization(WAPPO).
toshikwa/gec-app
This project contains frontend/backend application code and infrastructure for grammatical error correction.
toshikwa/slac-discrete.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC) extended for discrete action settings.
toshikwa/dmm-schedule-checker
DMM schedule checker continuously monitors the schedule of your favorite teachers, and notifies via LINE whenever new slots are available.
toshikwa/sagemaker-tutorial
Amazon SageMaker tutorial
toshikwa/ssm-enforcement-tool
This project contains a set of infrastructure implemented in Terraform to monitor your "not-managed-by-SSM" instances accross all regions.
toshikwa/alfred-aws-console-services-workflow
A powerful workflow for quickly opening up AWS Console Services in your browser or searching for entities within them.
toshikwa/MinAtar
toshikwa/rl-tutorials