marmotxcw's Stars
Xiang-cd/DiffEdit-stable-diffusion
An unofficial implement of DiffEdit on stable-diffusion
ruilin19/DiffEdit-by-Stable-Diffusion
An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”
runwayml/stable-diffusion
Latent Text-to-Image Diffusion
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Harry24k/adversarial-attacks-pytorch
PyTorch implementation of adversarial attacks [torchattacks].
Imageomics/bioclip
This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
sony/ctm
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
yibo-miao/PBO-Attack
Prior-guided Bayesian Optimization (P-BO) Black-box Adversarial Attack
leeguandong/Awesome-Chinese-Stable-Diffusion
中文文生图stable diffsion模型集合
Jasonlee1995/ImageNet-1K
ImageNet-1K data download, processing for using as a dataset
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
LLaVA-VL/LLaVA-NeXT
JailbreakBench/jailbreakbench
An Open Robustness Benchmark for Jailbreaking Language Models [arXiv 2024]
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
KindXiaoming/pykan
Kolmogorov Arnold Networks
ericyinyzy/VQAttack
This is an official repository of ``VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models'' (AAAI 2024). Codes are coming soon!)
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
hiyouga/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
AoiDragon/HADES
[ECCV'24] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models''
meta-llama/llama3
The official Meta Llama 3 GitHub site
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
CompVis/stable-diffusion
A latent text-to-image diffusion model
kunzhan/InfoMatch
IJCAI 2024, InfoMatch: Entropy neural estimation for semi-supervised image classification
isXinLiu/Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track