SiyeolJung

Siyeol Jung

SiyeolJung's Stars

CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook67.8k 558 71110.1k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook14.8k 110 3921.3k
XPixelGroup/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
Language:Python6.7k 91 5561.2k
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k 45 80539
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python4.8k 78 192394
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.7k 73 82425
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python2.5k 30 120198
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.4k 42 107221
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
Language:Python2.3k 45 70177
chaofengc/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Language:Python1.8k 16 157165
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.3k 62 223150
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
Language:Python998 26 4778
chaofengc/Awesome-Image-Quality-Assessment
A comprehensive collection of IQA papers
Language:TeX944 25 964
lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Language:Python617 6 1149
researchmm/MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Language:Python390 6 2222
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Language:Python346 10 1651
FoundationVision/OmniTokenizer
OmniTokenizer: one model and one weight for image-video joint tokenization.
Language:Python233 4 195
iffsid/mmvae
Multimodal Mixture-of-Experts VAE
Language:Python188 7 1141
sony/sqvae
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
Language:Python177 7 621
AlphacatPlus/VmambaIR
This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"
Language:Python166 18 512
lyndonzheng/CVQ-VAE
[ICCV 2023] Online Clustered Codebook
Language:Python137 4 19
yangdongchao/LLM-Codec
The open source code for LLM-Codec
Language:Python108 13 44
thuhcsi/S2G-MDDiffusion
Language:Python61 1 102
facebookresearch/Qinco
Residual Quantization with Implicit Neural Codebooks
Language:Python46 1 22
YingqianWang/DistgASR
[TPAMI 2023] DistgASR: Disentangling Mechanism for Light Field Angular Super-Resolution
Language:Python29 3 48
Boese0601/Dyadic-Interaction-Modeling
[ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation
Language:Python191
taegyeong-lee/Grid-Diffusion-Models-for-Text-to-Video-Generation
Official Code Repository for the paper "Grid Diffusion Models for Text-to-Video Generation", CVPR 2024
Language:Python15 3 10
kaistmm/VoxMM
Language:Python140
taegyeong-lee/Generating-Realistic-Images-from-In-the-wild-Sounds
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023
Language:Jupyter Notebook101
sangmin-git/MMSI
Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
Language:Python9 2 11

SiyeolJung

SiyeolJung's Stars

CompVis/stable-diffusion

KindXiaoming/pykan

XPixelGroup/BasicSR

facebookresearch/DiT

yl4579/StyleTTS2

resemble-ai/Resemblyzer

lucidrains/vector-quantize-pytorch

haoheliu/AudioLDM

haoheliu/AudioLDM2

chaofengc/IQA-PyTorch

ZiqiaoPeng/SyncTalk

bytedance/SALMONN

chaofengc/Awesome-Image-Quality-Assessment

lucidrains/mixture-of-experts

researchmm/MM-Diffusion

wesbz/SoundStream

FoundationVision/OmniTokenizer

iffsid/mmvae

sony/sqvae

AlphacatPlus/VmambaIR

lyndonzheng/CVQ-VAE

yangdongchao/LLM-Codec

thuhcsi/S2G-MDDiffusion

facebookresearch/Qinco

YingqianWang/DistgASR

Boese0601/Dyadic-Interaction-Modeling

taegyeong-lee/Grid-Diffusion-Models-for-Text-to-Video-Generation

kaistmm/VoxMM

taegyeong-lee/Generating-Realistic-Images-from-In-the-wild-Sounds

sangmin-git/MMSI