YangLing0818's Stars
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Vchitect/LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
lhao499/ringattention
Transformers with Arbitrarily Large Context
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
naver-ai/DenseDiffusion
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
dvlab-research/LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
RUCAIBox/StructGPT
The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"
vvictoryuki/AnimateZero
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"
orhir/PoseAnything
A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]
hutaiHang/Faster-Diffusion
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
sony/ctm
md-mohaiminul/VideoRecap
KU-CVLAB/DreamMatcher
Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (CVPR 2024)
YangLing0818/RealCompo
[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
YangLing0818/ContextDiff
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
YangLing0818/IPDiff
[ICLR 2024] Protein-Ligand Interaction Prior for Binding-aware 3D Molecule Diffusion Models
PRIV-Creation/Concept-centric-Personalization
The official code of "Concept-centric Personalization with Large-scale Diffusion Priors".
YangLing0818/BindDM
[AAAI 2024] Binding-Adaptive Diffusion Models for Structure-Based Drug Design