17Skye17's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
black-forest-labs/flux
Official inference repo for FLUX.1 models
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
cocodataset/cocoapi
COCO API - Dataset @ http://cocodataset.org/
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
openai/consistencydecoder
Consistency Distilled Diff VAE
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
buaacyw/GaussianEditor
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
yerfor/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
JusticeFighterDance/JusticeFighter110
田柯宇 (Tian Keyu)恶意攻击集群事件的证据揭露
ShenhanQian/GaussianAvatars
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
limuloo/MIGC
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
mjkwon2021/CAT-Net
Official code for CAT-Net: Compression Artifact Tracing Network. Image manipulation detection and localization.
yukangcao/GS-VTON
[arXiv 2024] GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
zhang-zx/AVID
This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.
dasGringuen/assetto_corsa_gym
Assetto Corsa OpenAI Gym Environment
IDT-ITI/MMFusion-IML
Code and trained models for our paper: K. Triaridis, V. Mezaris, "Exploring Multi-Modal Fusion for Image Manipulation Detection and Localization", Proc. 30th Int. Conf. on MultiMedia Modeling (MMM 2024), Amsterdam, NL, Jan.-Feb. 2024.
lyx0208/3dSwap
Code and project page for "3D-aware Face Swapping" in CVPR 2023
zhenglinpan/AnitaDataset
A free, licensed, and industrial animation dataset
owenzlz/PAL4VST
Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')