heatz123's Stars
heatz123/tldr
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
kvfrans/rlbase_stable
heatz123/heatz123.github.io
My personal website
quasimetric-learning/quasimetric-rl
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
seohongpark/HILP
Foundation Policies with Hilbert Representations (ICML 2024)
seohongpark/METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
jonbarron/website
eloialonso/iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
SyphonArch/swpp202301-compiler-team1
holenet/Pentris
Tetris variation game using blocks of 5 triangles
huggingface/trl
Train transformer language models with reinforcement learning.
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
hijkzzz/alpha-zero-gomoku
A Multi-threaded Implementation of AlphaZero
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Maghoumi/pytorch-softdtw-cuda
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Jmkernes/Diffusion
Everything related to diffusion models!
CGDTheGenius/Top3MainMatchFront
CGDTheGenius/Top3DeathMatchFront
CGDTheGenius/Top3MainMatchBack
CGDTheGenius/Top3DeathMatchBack
CGDTheGenius/Rules
체계단 더지니어스 룰
tts-tutorial/interspeech2022
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
holenet/Reinforcement-Learning-Gomoku-Web-Client
reinforcement-learning-kr/alpha_omok
Minimal version of DeepMind AlphaZero
heatz123/Reinforcement-Learning-Gomoku
junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)