fenfenfenfan

start here

HUSTWuhan，China

fenfenfenfan's Stars

Zuellni/ComfyUI-PickScore-Nodes
PickScore nodes for ComfyUI.
Language:Python326
VectorSpaceLab/OmniGen
65012
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python3.7k314
showlab/Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
1521
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python88239
XLabs-AI/x-flux
Language:Python1.4k102
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.8k729
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Language:Python1.6k143
kongds/E5-V
E5-V: Universal Embeddings with Multimodal Large Language Models
Language:Python1536
Xiaojiu-z/Stable-Hair
Stable-Hair: Real-World Hair Transfer via Diffusion Model
34522
fusiming3/MARS
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
812
GAIR-NLP/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Language:Python65236
ShaShekhar/aaiela
Language:Python16110
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.3k60
lks-ai/anynode
A Node for ComfyUI that does what you ask it to do
Language:Python48532
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Language:Python1.7k137
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python23721
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Language:Python26.9k1.4k
wangkai930418/DPL
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)
Language:Python864
Xiaojiu-z/SSR_Encoder
Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)
Language:Python875
LC044/WeChatMsg
提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手
Language:Python33.6k3.5k
FreeStyleFreeLunch/FreeStyle
FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models
Language:Python1045
CaraJ7/CoMat
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Language:Python1255
idealo/image-quality-assessment
Convolutional Neural Networks to predict the aesthetic and technical quality of images.
Language:Python2.1k447
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Language:Python87987
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
Language:Python3107
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language:Python1.4k114
WUyinwei-hah/RRNet
[CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model
Language:Python443
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Language:Python30014
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.3k529

fenfenfenfan

fenfenfenfan's Stars

Zuellni/ComfyUI-PickScore-Nodes

VectorSpaceLab/OmniGen

modelscope/ms-swift

showlab/Awesome-Unified-Multimodal-Models

showlab/Show-o

XLabs-AI/x-flux

THUDM/CogVideo

bghira/SimpleTuner

kongds/E5-V

Xiaojiu-z/Stable-Hair

fusiming3/MARS

GAIR-NLP/anole

ShaShekhar/aaiela

dvlab-research/ControlNeXt

lks-ai/anynode

Nerogar/OneTrainer

SalesforceAIResearch/DiffusionDPO

google/python-fire

wangkai930418/DPL

Xiaojiu-z/SSR_Encoder

LC044/WeChatMsg

FreeStyleFreeLunch/FreeStyle

CaraJ7/CoMat

idealo/image-quality-assessment

christophschuhmann/improved-aesthetic-predictor

cosmicman-cvpr2024/CosmicMan

TencentARC/BrushNet

WUyinwei-hah/RRNet

OSU-NLP-Group/MagicBrush

fudan-generative-vision/champ