Pinned Repositories
RandomizedValueFunctions
Randomized Value Functions via Multiplicative Normalizing Flows
2020algorithm_study
charm
Charm: A Framework for Rapidly Prototyping Cryptosystems
Class-Projects
University of Maryland Class Projects
Cola
협업합시다 :)
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
recsim_ng-forked
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
TempoRL
twoyak_back
soonjune's Repositories
soonjune/2020algorithm_study
soonjune/charm
Charm: A Framework for Rapidly Prototyping Cryptosystems
soonjune/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
soonjune/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
soonjune/recsim_ng-forked
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
soonjune/RL_project
nchain
soonjune/TempoRL
soonjune/twoyak_back
soonjune/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
soonjune/Deep-Q-Learning-Paper-To-Code
soonjune/DQN_tutorial
soonjune/github-slideshow
A robot powered training repository :robot:
soonjune/hill-cipher
Sviluppo del cifrario di Hill. Permette all'utente di cifrare e decifrare con il cifrario di Hill e di forzare un ciphertext tramite l'attacco known plaintext.
soonjune/klaw_case_collector
Korean precedent collector
soonjune/movielens_feat_extraction
User / Item feature extraction from MovieLens 100K dataset
soonjune/nbconvert
Jupyter Notebook Conversion
soonjune/nchain_temporally_extended
soonjune/Neural-Linear-Bandits-with-Likelihood-Matching
soonjune/option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
soonjune/os341
operating systems HW
soonjune/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
soonjune/q-trader_test_for_samsung
soonjune/ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
soonjune/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
soonjune/rl_trader
DQN for stock day trading
soonjune/soonjune.github.io
soonjune/SparceReward
soonjune/spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
soonjune/SWAG_project
for experimenting swag
soonjune/visualization