shlee94

.

shlee94's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k 1.1k 15.7k26.3k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.2k 429 4.2k6.4k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.3k 73 1.1k1.2k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.8k 61 1.5k1.7k
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.3k 20 171279
wilson1yan/VideoGPT
Language:Jupyter Notebook964 23 38117
atulkum/pointer_summarizer
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
Language:Python904 16 61243
bryandlee/malnyun_faces
침착한 생성모델 학습기
903 22 466
aviralkumar2907/CQL
Code for conservative Q-learning
Language:Python394 6 2069
alinlab/CSI
CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances (NeurIPS 2020)
Language:Python273 7 5462
BerenMillidge/FEP_Active_Inference_Papers
A repository for major/influential FEP and active inference papers.
Language:TeX178 23 123
schatty/d4pg-pytorch
PyTorch implementation of Distributed Distributional Deterministic Policy Gradients
Language:Python120 8 626
pokaxpoka/sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Language:Python119 6 429
alvinchangw/COCON_ICLR2021
Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation
Language:Python94 4 922
younggyoseo/CaDM
CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning
Language:Python63 7 58
younggyoseo/trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
Language:Python39 3 15
xkianteb/dril
Disagreement-Regularized Imitation Learning
Language:Python30 3 012
younggyoseo/lasertag-v0
Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)
Language:Python18 2 02
godtn0/DP-MTL
Language:Python111
useradd-temp/py-twitch
Python Client for Twitch API
Language:Python30