HaozheLiu-ST

PhD Student, AI Initiative KAUST

AI Initiative, KAUST; prev. TencentSA

HaozheLiu-ST's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.9k 234 2753.2k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python19.4k 166 01.4k
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.7k 120 109446
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Language:Python1.7k 40 12985
NX-AI/xlstm
Official repository of the xLSTM.
Language:Python1.6k 18 57119
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.5k 43 58153
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.5k 21 7158
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python1.2k 18 7366
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1.1k 15 4748
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Language:Python1.1k 41 4859
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Language:Python830 15 5440
metauto-ai/GPTSwarm
🐝 GPTSwarm: LLM agents as (Optimizable) Graphs
Language:Python728 9 1348
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Language:Python682 11 8534
kongzhecn/OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
Language:Python674 13 2445
HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Language:Python373 11 1824
universome/stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
Language:Python362 19 4636
JunyaoHu/common_metrics_on_video_quality
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
Language:Python272 1 1811
ximinng/SVGDreamer
[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476
Language:Python253 8 2523
ximinng/DiffSketcher
[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685
Language:Python250 8 1428
Karine-Huang/T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
Language:Python225 3 277
sming256/OpenTAD
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
Language:Python206 5 4014
czg1225/AsyncDiff
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
Language:Python173 4 1012
ximinng/VectorFusion-pytorch
[CVPR 2023] Unofficial implementation for "VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models"
Language:Python128 2 47
JettHu/ComfyUI_TGate
T-GATE implementation for ComfyUI.
Language:Python90 3 159
SAIS-FUXI/VidGen
Language:Python58 2 64
azminewasi/Awesome-LLMs-ICLR-24
It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.
46 1 01
BenchCouncil/AIGCBench
Official repo for AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI
Language:Python37 0 31
showlab/Long-form-Video-Prior
Language:Python24 3 2
showlab/Tune-An-Ellipse
[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want
9 2 11
piotrpiekos/adaptive-printer
:boom: Utilizing a Malfunctioning 3D Printer by Modeling Its Dynamics with Artificial Intelligence (ICRA 2024)
Language:Python10

HaozheLiu-ST

HaozheLiu-ST's Stars

meta-llama/llama3

black-forest-labs/flux

FoundationVision/VAR

PixArt-alpha/PixArt-sigma

NX-AI/xlstm

Picsart-AI-Research/StreamingT2V

FoundationVision/LlamaGen

LTH14/mar

showlab/Show-o

TencentQQGYLab/ELLA

horseee/DeepCache

metauto-ai/GPTSwarm

Vchitect/VBench

kongzhecn/OMG

HaozheLiu-ST/T-GATE

universome/stylegan-v

JunyaoHu/common_metrics_on_video_quality

ximinng/SVGDreamer

ximinng/DiffSketcher

Karine-Huang/T2I-CompBench

sming256/OpenTAD

czg1225/AsyncDiff

ximinng/VectorFusion-pytorch

JettHu/ComfyUI_TGate

SAIS-FUXI/VidGen

azminewasi/Awesome-LLMs-ICLR-24

BenchCouncil/AIGCBench

showlab/Long-form-Video-Prior

showlab/Tune-An-Ellipse

piotrpiekos/adaptive-printer