huzimun's Stars
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
CodeGoat24/Face-diffuser
[CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.
xavihart/PDM-Pure
PDM-based Purifier
psyker-team/mist-v2
A watermarking tool to protect artworks from AIGC-driven style mimicry (e.g. LoRA)
caradryanl/ACE
chaofengc/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
xavihart/Diff-Protect
🛡️[ICLR'2024] Toward effective protection against diffusion-based mimicry through score distillation, a.k.a SDS-Attack
SiatMMLab/Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
zhangxulu1996/awesome-personalization
datar001/web-face-editor
An interactive web editing face attributes. Users can upload a face picture, and select a attribute~(hair, eye, glasses, hair color, etc), web can randomly change this attribute.
baaivision/Emu3
Next-Token Prediction is All You Need
lewandofskee/DiAD
Official implementation of DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection.
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
LlamaFamily/Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
seanzhang-zhichen/llama3-chinese
Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
datar001/Attack-Pattern-on-T2I
Adversarial Attacks on Malicious Image Generation and its Underlying Pattern Discovery
datar001/Revealing-Vulnerabilities-in-Stable-Diffusion-via-Targeted-Attacks
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
lifangting/CVPR2024-Diffusion-Model
CVPR2024-Diffusion-Model
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
overleaf/toolkit
huzimun/FaceOff
Preventing Unauthorized Text-to-Image Identity Customization
datar001/Awesome-AD-on-T2IDM
A collection of resources on attacks and defenses targeting text-to-image diffusion models