JieLiu95's Stars
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
state-spaces/mamba
Mamba SSM architecture
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
guoyww/AnimateDiff
Official implementation of AnimateDiff.
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
lllyasviel/Omost
Your image is almost there!
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
openai/consistencydecoder
Consistency Distilled Diff VAE
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
ChenyangQiQi/FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Tsingularity/dift
[NeurIPS'23] Emergent Correspondence from Image Diffusion
sibozhang/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
csguoh/MambaIR
[ECCV2024] An official pytorch implement of the paper "MambaIR: A simple baseline for image restoration with state-space model".
Kobaayyy/Awesome-CVPR2024-ECCV2024-AIGC
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
ShihaoZhaoZSH/LaVi-Bridge
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
lelechen63/ATVGnet
CVPR 2019
flyingby/Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
wenhao728/awesome-diffusion-v2v
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.
YangLing0818/EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
lewandofskee/MambaAD
Official implementation of MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.
koushiksrivats/FLIP
Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)
zhourax/VEGA