Pinned Repositories
cgan_for_edges_to_shoes
Edges to Shoes with cGAN (Pix2Pix)
logos_parser
Parser for Logos from www.brandsoftheworld.com
mbrl_multitasking
Model-Based RL Multi-Tasking with ReLAx
music_album_captioning
Music Album Style Image Quoting with Transformers
parallel_ppo
Speeding Up PPO with Parallel Sampling
president_speech
Finetuning GPT2 to answer questions like Vladimir Putin
rainbow_for_2048
Playing 2048 with Rainbow agent
relax
ReLAx - Reinforcement Learning Applications Library
relax_mbpo_example
Example MBPO implementation with ReLAx
rnns_for_pomdp
Recurrent Policies for Handling Partially Observable Environments
nslyubaykin's Repositories
nslyubaykin/relax
ReLAx - Reinforcement Learning Applications Library
nslyubaykin/mbrl_multitasking
Model-Based RL Multi-Tasking with ReLAx
nslyubaykin/rainbow_for_2048
Playing 2048 with Rainbow agent
nslyubaykin/relax_mbpo_example
Example MBPO implementation with ReLAx
nslyubaykin/rnns_for_pomdp
Recurrent Policies for Handling Partially Observable Environments
nslyubaykin/music_album_captioning
Music Album Style Image Quoting with Transformers
nslyubaykin/parallel_ppo
Speeding Up PPO with Parallel Sampling
nslyubaykin/president_speech
Finetuning GPT2 to answer questions like Vladimir Putin
nslyubaykin/low_res_logos_gan
DC GAN for low resolution logo generation
nslyubaykin/nstep_td3
Multistep TD3 for locomotion
nslyubaykin/ppo_with_dqn_critic
Training PPO with DQN as a critic
nslyubaykin/prioritized_ddqn
Prioritized DDQN example with ReLAx
nslyubaykin/relax_a2c_example
Example A2C implementation with ReLAx
nslyubaykin/relax_categorical_dqn_example
Example Categorical DQN implementation with ReLAx
nslyubaykin/relax_cem_example
Example CEM implementation with ReLAx
nslyubaykin/relax_ddpg_example
Example DDPG implementation with ReLAx
nslyubaykin/relax_double_dqn_example
Example Double DQN implementation with ReLAx
nslyubaykin/relax_dqn_example
Example DQN implementation with ReLAx
nslyubaykin/relax_dueling_dqn_example
Example Dueling DQN implementation with ReLAx
nslyubaykin/relax_dyna_q_example
Example DYNA-Q implementation with ReLAx
nslyubaykin/relax_frwr_example
Example FRWR (PDDM) implementation with ReLAx
nslyubaykin/relax_noisy_dqn_example
Example Noisy DQN implementation with ReLAx
nslyubaykin/relax_ppo_example
Example PPO implementation with ReLAx
nslyubaykin/relax_rainbow_dqn_example
Example Rainbow DQN implementation with ReLAx
nslyubaykin/relax_random_shooting_example
Example Random Shooting implementation with ReLAx
nslyubaykin/relax_sac_example
Example SAC implementation with ReLAx
nslyubaykin/relax_td3_example
Example TD3 implementation with ReLAx
nslyubaykin/relax_trpo_example
Example TRPO implementation with ReLAx
nslyubaykin/relax_vpg_example
Example VPG implementation with ReLAx
nslyubaykin/trpo_schedule_kl
Scheduling TRPO's KL Divergence Constraint