heatz123

Computer Science and Engineering @ SNU

heatz123's Stars

heatz123/tldr
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Language:Python131
kvfrans/rlbase_stable
Language:Python361
heatz123/heatz123.github.io
My personal website
Language:HTML1
quasimetric-learning/quasimetric-rl
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
Language:Python395
seohongpark/HILP
Foundation Policies with Hilbert Representations (ICML 2024)
Language:Python593
seohongpark/METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
Language:Python464
jonbarron/website
Language:HTML2.4k1.9k
eloialonso/iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Language:Python78175
SyphonArch/swpp202301-compiler-team1
Language:C++1
holenet/Pentris
Tetris variation game using blocks of 5 triangles
Language:Java2
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python8.8k1.1k
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python2.1k190
hijkzzz/alpha-zero-gomoku
A Multi-threaded Implementation of AlphaZero
Language:Python35848
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.5k1.2k
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language:Python18644
Maghoumi/pytorch-softdtw-cuda
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch
Language:Python60656
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Language:Python45267
Jmkernes/Diffusion
Everything related to diffusion models!
Language:Jupyter Notebook336
CGDTheGenius/Top3MainMatchFront
Language:Svelte2
CGDTheGenius/Top3DeathMatchFront
Language:Svelte2
CGDTheGenius/Top3MainMatchBack
Language:Python2
CGDTheGenius/Top3DeathMatchBack
Language:Python2
CGDTheGenius/Rules
체계단 더지니어스 룰
3
tts-tutorial/interspeech2022
1605
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python29.8k6.3k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python129k25.7k
holenet/Reinforcement-Learning-Gomoku-Web-Client
Language:Svelte2
reinforcement-learning-kr/alpha_omok
Minimal version of DeepMind AlphaZero
Language:Python8020
heatz123/Reinforcement-Learning-Gomoku
Language:Python5
junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Language:Python3.2k963

heatz123

heatz123's Stars

heatz123/tldr

kvfrans/rlbase_stable

heatz123/heatz123.github.io

quasimetric-learning/quasimetric-rl

seohongpark/HILP

seohongpark/METRA

jonbarron/website

eloialonso/iris

SyphonArch/swpp202301-compiler-team1

holenet/Pentris

huggingface/trl

allenai/RL4LMs

hijkzzz/alpha-zero-gomoku

jaywalnut310/vits

keonlee9420/Parallel-Tacotron2

Maghoumi/pytorch-softdtw-cuda

heatz123/naturalspeech

Jmkernes/Diffusion

CGDTheGenius/Top3MainMatchFront

CGDTheGenius/Top3DeathMatchFront

CGDTheGenius/Top3MainMatchBack

CGDTheGenius/Top3DeathMatchBack

CGDTheGenius/Rules

tts-tutorial/interspeech2022

facebookresearch/fairseq

huggingface/transformers

holenet/Reinforcement-Learning-Gomoku-Web-Client

reinforcement-learning-kr/alpha_omok

heatz123/Reinforcement-Learning-Gomoku

junxiaosong/AlphaZero_Gomoku