Pinned Repositories
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
zigma
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
DiffSynth-Studio
Enjoy the magic of Diffusion models!
ScaleCrafter-ptl
1111112's Repositories
1111112/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image