RichFrain's Stars
DDGRCF/YOLOX_OBB
https://zhuanlan.zhihu.com/p/430850089
Akegarasu/lora-scripts
LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
YaoleiQi/DSCNet
Pytorch Implement of Dynamic Snake Convolution (ICCV2023)
SJTU-Thinklab-Det/DOTA-DOAI
This repo is the codebase for our team to participate in DOTA related competitions, including rotation and horizontal detection.
ZGCTroy/LayoutDiffusion
Picsart-AI-Research/IPL-Zero-Shot-Generative-Model-Adaptation
[CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning
CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
bubbliiiing/yolox-pytorch
这是一个yolox-pytorch的源码,可以用于训练自己的模型。
TencentARC/T2I-Adapter
T2I-Adapter
WH-HuanWang/Defect-GLM
Defect-GLM:A Large Visual-Language Model for Industrial Defect Monitoring|首个用于工业缺陷监测的开源大规模视觉语言模型
yuhongtian17/Spatial-Transform-Decoupling
msracver/Deformable-ConvNets
Deformable Convolutional Networks
mshenoda/diffugen
Generating Labeled Image Datasets using Stable Diffusion Models
lilijiangg/AutoDiffusion
jmhessel/clipscore
CLIPScore EMNLP code
luping-liu/PNDM
The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM, PLMS | ICLR2022)
JDAI-CV/CoTNet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
mlfoundations/patching
Patching open-vocabulary models by interpolating weights
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
bfshi/AbSViT
Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)
LeapLabTHU/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
lyhue1991/torchkeras
Pytorch❤️ Keras 😋😋
rinongal/textual_inversion
ali-vilab/Cones-V2
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
noagarcia/phase
PHASE annotations for societal bias in vision-and-language tasks.
baofff/U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
Dao-AILab/flash-attention
Fast and memory-efficient exact attention