soonjune

Pinned Repositories

RandomizedValueFunctions
Randomized Value Functions via Multiplicative Normalizing Flows
Language:Python18 6 310
2020algorithm_study
Language:Python0 0 00
charm
Charm: A Framework for Rapidly Prototyping Cryptosystems
Language:C0 0 00
Class-Projects
University of Maryland Class Projects
Language:Java0 0 00
Cola
협업합시다 :)
Language:Ruby0 0 00
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python0 0 00
pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python00
recsim_ng-forked
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Language:Jupyter Notebook0 0 00
TempoRL
Language:Python0 0 00
twoyak_back
Language:Ruby0 0 00

soonjune's Repositories

soonjune/2020algorithm_study
Language:Python0 0 00
soonjune/charm
Charm: A Framework for Rapidly Prototyping Cryptosystems
Language:C0 0 00
soonjune/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python0 0 00
soonjune/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python00
soonjune/recsim_ng-forked
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Language:Jupyter Notebook0 0 00
soonjune/RL_project
nchain
Language:Python00
soonjune/TempoRL
Language:Python0 0 00
soonjune/twoyak_back
Language:Ruby0 0 00
soonjune/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
Language:Jupyter Notebook
soonjune/Deep-Q-Learning-Paper-To-Code
soonjune/DQN_tutorial
Language:Python
soonjune/github-slideshow
A robot powered training repository :robot:
Language:HTML
soonjune/hill-cipher
Sviluppo del cifrario di Hill. Permette all'utente di cifrare e decifrare con il cifrario di Hill e di forzare un ciphertext tramite l'attacco known plaintext.
soonjune/klaw_case_collector
Korean precedent collector
Language:Python
soonjune/movielens_feat_extraction
User / Item feature extraction from MovieLens 100K dataset
Language:Jupyter Notebook
soonjune/nbconvert
Jupyter Notebook Conversion
soonjune/nchain_temporally_extended
Language:Python
soonjune/Neural-Linear-Bandits-with-Likelihood-Matching
soonjune/option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
soonjune/os341
operating systems HW
Language:C
soonjune/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
soonjune/q-trader_test_for_samsung
Language:Jupyter Notebook
soonjune/ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
soonjune/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
soonjune/rl_trader
DQN for stock day trading
Language:Python1 0
soonjune/soonjune.github.io
Language:HTML
soonjune/SparceReward
soonjune/spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
soonjune/SWAG_project
for experimenting swag
Language:Python
soonjune/visualization
Language:Python1 0