Pinned Repositories
DraftRec
Code for the paper "DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games" (WWW 2022)
2021_davian_deep_learning_study
A2C_CURL_MARIO
Implementation of A2C and CURL for Super Mario Bros environment
Algorithm
Computer_Architecture
COSE461_NLP
Korea Univ. COSE461 Natural Language Processing
d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
DA-in-visualRL
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
director
Deep Hierarchical Planning from Pixels
DMC_Clustering_PICA
Continuous Control Task Interpretation via Clustering
godnpeter's Repositories
godnpeter/DMC_Clustering_PICA
Continuous Control Task Interpretation via Clustering
godnpeter/2021_davian_deep_learning_study
godnpeter/A2C_CURL_MARIO
Implementation of A2C and CURL for Super Mario Bros environment
godnpeter/Algorithm
godnpeter/Computer_Architecture
godnpeter/COSE461_NLP
Korea Univ. COSE461 Natural Language Processing
godnpeter/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
godnpeter/DA-in-visualRL
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
godnpeter/director
Deep Hierarchical Planning from Pixels
godnpeter/dmcontrol-generalization-benchmark
DMControl Generalization Benchmark from Nicklas Hansen
godnpeter/DQN_MARIO
godnpeter/DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
godnpeter/DSSRec
Disentangled Self-Supervision in Sequential Recommenders
godnpeter/godnpeter.github.io
godnpeter/how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
godnpeter/laber
godnpeter/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
godnpeter/machine-learning-interview
Minimum Viable Study Plan for Machine Learning Interviews from FAAG, Snapchat, LinkedIn.
godnpeter/ManiSkill
godnpeter/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
godnpeter/Operating-Systems
CPU scheduler. Language : C
godnpeter/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
godnpeter/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
godnpeter/RL4RS
A Real-World Benchmark for Reinforcement Learning based Recommender System
godnpeter/RSPapers
A Curated List of Must-read Papers on Recommender System.
godnpeter/stats701-winter2021
Theory of Reinforcement Learning
godnpeter/TOV-VICReg
godnpeter/train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
godnpeter/udlbook
Understanding Deep Learning - Simon J.D. Prince