ZyoungXu's Stars
996icu/996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Vchitect/Latte
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
latentcat/latentbox
A collection of awesome-lists for AI, creativity and art. AI、创意和艺术领域的精选合集。https://latentbox.com
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
DataXujing/YOLOv8
:fire: Official YOLOv8模型训练和部署
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
AntonioTepsich/Convolutional-KANs
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
tianweiy/DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
catcathh/UltraPixel
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
yuvalkirstain/PickScore
ulab-uiuc/AGI-survey
voletiv/mcvd-pytorch
Official implementation of MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation (https://arxiv.org/abs/2205.09853)
Ji4chenLi/t2v-turbo
Code repository for T2V-Turbo and T2V-Turbo-v2
hustvl/DiG
[CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
dengxl0520/MemSAM
[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.
QiaoLiuHit/LSOTB-TIR
LSOTB-TIR: A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark (ACM MM2020)
hustvl/ViG
[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention
berlino/gated_linear_attention
clintonjwang/ControlNet
Generate videos that interpolate between two given images
gjhhust/YOLOFT
A code base for the official XS-VID dataset baseline method YOLOFT
gjhhust/XS-VID
XS-VID: An Extra Small Object Video Detection Dataset
ZyoungXu/GenDSA
🔥V2 coming soon! Under reviewing~🔥[Med - Cell Press] Large-scale Pretrained Frame Generative Model Enables Real-Time Low-Dose DSA Imaging: an AI System Development and Multicenter Validation Study
ZyoungXu/MoSt-DSA
[ECAI 2024] MoSt-DSA: Modeling Motion and Structural Interactions for Direct Multi-Frame Interpolation in DSA Images