yilei0620's Stars
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
black-forest-labs/flux
Official inference repo for FLUX.1 models
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
deep-floyd/IF
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
novicezk/midjourney-proxy
代理 MidJourney 的discord频道,实现api形式调用AI绘图
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
dome272/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
ziqi-jin/finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
OpenTexture/Paint3D
[CVPR 2024] Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model
IDEA-Research/OpenSeeD
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
ShihaoZhaoZSH/Uni-ControlNet
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
jkulhanek/wild-gaussians
[NeurIPS'24] WildGaussians: 3D Gaussian Splatting In the Wild
RockeyCoss/Prompt-Segment-Anything
This is an implementation of zero-shot instance segmentation using Segment Anything.
passivebot/midjourney-automation-bot
This repository hosts the Midjourney Automation Bot, a free script leveraging OpenAI's GPT-3 for automated image generation via Discord. It offers a simple web interface, customizable settings, and is MIT licensed for ease of use and adaptation.
junleen/RainNet
[CVPR 2021] Region-aware Adaptive Instance Normalization for Image Harmonization
zeng-yifei/STAG4D
Official Implementation for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
ifsheldon/stannum
Fusing Taichi into PyTorch
HighCWu/control-lora-v2
ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2
TideDra/VL-RLHF
A RLHF Infrastructure for Vision-Language Models
RodinHD/RodinHD
[ECCV 2024] RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
bao-io/midjourney-sdk
MidJourney in Discord API.