zipengxuc
Ph.D. student at MHUG, University of Trento. Research Interests: Generative Models, Vision-Language.
University of TrentoItaly
zipengxuc's Stars
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
lllyasviel/ControlNet
Let us control diffusion models!
mli/paper-reading
深度学习经典、新论文逐段精读
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
ytongbai/LVM
uncbiag/Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
jianzongwu/Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
fnzhan/Generative-AI
[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
SkalskiP/top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
salesforce/UniControl
Unified Controllable Visual Generation Model
SeedV/generative-ai-roadmap
The roadmap of generative AI: use cases and applications | 生成式AI的应用路线图
dome272/Wuerstchen
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
Audio-AGI/WavJourney
WavJourney: Compositional Audio Creation with LLMs
codingonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
CroitoruAlin/Diffusion-Models-in-Vision-A-Survey
This repository categorizes the papers about diffusion models applied in computer vision according to their target task. The classifcation is based on our survey: https://arxiv.org/abs/2209.04747v1
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
inbarhub/DDPM_inversion
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
mwxely/AIGS
AI-Generated Images as Data Source: The Dawn of Synthetic Era
EGCap/awesome-gpt4-vision
A collection of awesome GPT4 vision use cases
altndrr/vic
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
j-min/VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)