JieLiu95

JieLiu95's Stars

NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫 | 知乎问答文章｜评论爬虫
Language:Python16.7k 103 2975.3k
state-spaces/mamba
Mamba SSM architecture
Language:Python12.7k 101 5091.1k
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.7k 125 217785
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.3k 104 353846
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Language:Python9.5k 78 116681
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Language:Jupyter Notebook9.4k 103 158746
lllyasviel/Omost
Your image is almost there!
Language:Python7.2k 44 78418
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.4k 72 241950
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.1k 61 377328
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python2.9k 30 111184
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k 46 0171
openai/consistencydecoder
Consistency Distilled Diff VAE
Language:Python2.1k 21 2075
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2k 31 8485
SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Language:Python1.7k 30 75118
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language:Jupyter Notebook1.7k 26 5193
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language:Python1.7k 28 87136
ChenyangQiQi/FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Language:Jupyter Notebook1.1k 14 33106
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Language:Python793 12 8859
Tsingularity/dift
[NeurIPS'23] Emergent Correspondence from Image Diffusion
Language:Python589 7 2532
sibozhang/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Language:Python419 12 2291
csguoh/MambaIR
[ECCV2024] An official pytorch implement of the paper "MambaIR: A simple baseline for image restoration with state-space model".
Language:Python418 4 5536
Kobaayyy/Awesome-CVPR2024-ECCV2024-AIGC
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
416 7 112
ShihaoZhaoZSH/LaVi-Bridge
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Language:Python307 16 1620
lelechen63/ATVGnet
CVPR 2019
Language:Python260 16 5254
flyingby/Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
251 13 19
wenhao728/awesome-diffusion-v2v
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.
Language:Python116 4 34
YangLing0818/EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Language:Python112 7 67
lewandofskee/MambaAD
Official implementation of MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.
Language:Python83 6 80
koushiksrivats/FLIP
Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)
Language:Python62 4 133
zhourax/VEGA
Language:Python31 1 42