godnpeter

Pinned Repositories

DraftRec
Code for the paper "DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games" (WWW 2022)
Language:Jupyter Notebook17 0 24
2021_davian_deep_learning_study
00
A2C_CURL_MARIO
Implementation of A2C and CURL for Super Mario Bros environment
Language:Python0 1 00
Algorithm
Language:C0 1 00
Computer_Architecture
Language:VHDL00
COSE461_NLP
Korea Univ. COSE461 Natural Language Processing
Language:Jupyter Notebook0 0 00
d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
Language:Jupyter Notebook00
DA-in-visualRL
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
0 0 00
director
Deep Hierarchical Planning from Pixels
Language:Python0 0 00
DMC_Clustering_PICA
Continuous Control Task Interpretation via Clustering
Language:Python10

godnpeter's Repositories

godnpeter/DMC_Clustering_PICA
Continuous Control Task Interpretation via Clustering
Language:Python10
godnpeter/2021_davian_deep_learning_study
00
godnpeter/A2C_CURL_MARIO
Implementation of A2C and CURL for Super Mario Bros environment
Language:Python0 1 00
godnpeter/Algorithm
Language:C0 1 00
godnpeter/Computer_Architecture
Language:VHDL00
godnpeter/COSE461_NLP
Korea Univ. COSE461 Natural Language Processing
Language:Jupyter Notebook0 0 00
godnpeter/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
Language:Jupyter Notebook00
godnpeter/DA-in-visualRL
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
0 0 00
godnpeter/director
Deep Hierarchical Planning from Pixels
Language:Python0 0 00
godnpeter/dmcontrol-generalization-benchmark
DMControl Generalization Benchmark from Nicklas Hansen
Language:Jupyter Notebook0 0
godnpeter/DQN_MARIO
Language:Python
godnpeter/DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
Language:Python0 0
godnpeter/DSSRec
Disentangled Self-Supervision in Sequential Recommenders
Language:Python0 0
godnpeter/godnpeter.github.io
Language:HTML
godnpeter/how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
godnpeter/laber
Language:Python0 0
godnpeter/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
godnpeter/machine-learning-interview
Minimum Viable Study Plan for Machine Learning Interviews from FAAG, Snapchat, LinkedIn.
godnpeter/ManiSkill
Language:Python
godnpeter/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
godnpeter/Operating-Systems
CPU scheduler. Language : C
Language:C1 0
godnpeter/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python0 0
godnpeter/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python0 0
godnpeter/RL4RS
A Real-World Benchmark for Reinforcement Learning based Recommender System
Language:Python0 0
godnpeter/RSPapers
A Curated List of Must-read Papers on Recommender System.
godnpeter/stats701-winter2021
Theory of Reinforcement Learning
0 0
godnpeter/TOV-VICReg
godnpeter/train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
godnpeter/udlbook
Understanding Deep Learning - Simon J.D. Prince