shlee94's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
huggingface/trl
Train transformer language models with reinforcement learning.
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
wilson1yan/VideoGPT
atulkum/pointer_summarizer
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
bryandlee/malnyun_faces
침착한 생성모델 학습기
aviralkumar2907/CQL
Code for conservative Q-learning
alinlab/CSI
CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances (NeurIPS 2020)
BerenMillidge/FEP_Active_Inference_Papers
A repository for major/influential FEP and active inference papers.
schatty/d4pg-pytorch
PyTorch implementation of Distributed Distributional Deterministic Policy Gradients
pokaxpoka/sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
alvinchangw/COCON_ICLR2021
Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation
younggyoseo/CaDM
CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning
younggyoseo/trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
xkianteb/dril
Disagreement-Regularized Imitation Learning
younggyoseo/lasertag-v0
Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)
godtn0/DP-MTL
useradd-temp/py-twitch
Python Client for Twitch API