zhangquanwei962

zhangquanwei962's Stars

magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python10.4k1.1k
apple/ml-mgie
Language:Python3.8k253
haofanwang/cropimage
A simple toolkit for detecting and cropping main body from pictures. Support face and saliency detection.
Language:Python415
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Language:Python1.2k149
yzhang2016/video-generation-survey
A reading list of video generation
37829
open-mmlab/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型，可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成，只需要一个模型
Language:Python58538
vislearn/ControlNet-XS
Language:Python44012
cientgu/InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
Language:Python37820
webtoon/dreamstyler
Official implementation of "DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models" (AAAI24)
Language:Python373
FreeStyleFreeLunch/FreeStyle
FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models
Language:Python1046
google/style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
Language:Python1.2k90
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.6k1.1k
TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
Language:Python71926
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Language:Python1.8k156
yeungchenwa/FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
Language:Python27623
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.4k165
genforce/freecontrol
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Language:Python42512
nmhkahn/dreamstyler
Language:JavaScript7
sled-group/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
Language:Python2688
Jamie-Cheung/ArtBank
ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank (AAAI2024)
Language:Jupyter Notebook322
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++61.4k9.4k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.6k471
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Language:Python27711
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.1k331
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.5k333
Sanster/xy-cut
Language:Python7615
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python5.9k407
Yuxinn-J/Scenimefy
[ICCV 2023] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
Language:Python26317
langmanbusi/Semantic-Aware-Low-Light-Image-Enhancement
Semantic-Aware LLIE. CVPR 2023
Language:Python1947
AlenUbuntu/StyleTransfer
an PyTorch image deep style transfer library. It provies implementations of current SOTA algorithms, including AdaIN, WCT, LinearStyleTransfer, and FastPhotoTransfer
Language:Python132