xyxxmb

xyxxmb's Stars

naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.2k 92 161.1k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.1k 100 530847
lllyasviel/Omost
Your image is almost there!
Language:Python7.2k 44 78418
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook6.9k 74 204440
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python4.7k 30 493382
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.3k 40 168285
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.5k 38 152299
XLabs-AI/x-flux
Language:Python1.4k 27 10098
XLabs-AI/x-flux-comfyui
Language:Python868 10 9860
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Language:Jupyter Notebook747 6 3142
cubiq/PuLID_ComfyUI
PuLID native implementation for ComfyUI
Language:Python578 8 6836
csyxwei/ELITE
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
Language:Python510 43 2030
AIGText/Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
Language:Jupyter Notebook489 17 1721
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
Language:Python465 17 733
hehao13/CameraCtrl
Language:Python411 12 1517
wyysf-98/CraftsMan
CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
Language:Python392 14 2617
MC-E/ReVideo
Language:Python304 24 58
sail-sg/CLoT
CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".
Language:Python296 8 2012
haoosz/ViCo
Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"
Language:Jupyter Notebook236 19 1515
open-mmlab/StyleShot
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!
Language:Python224 2 1814
zibojia/COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
Language:Python218 5 83
bytedance/MoMA
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Language:Jupyter Notebook181 3 1016
Ling-APE/ComfyUI-All-in-One-FluxDev-Workflow
An All-in-One FluxDev workflow in ComfyUI that combines various techniques for generating images with the FluxDev model, including img-to-img and text-to-img. This workflow can use LoRAs, ControlNets, enabling negative prompting with Ksampler, dynamic thresholding, inpainting, and more.
149 6 104
discus0434/aesthetic-predictor-v2-5
SigLIP-based Aesthetic Score Predictor
Language:Python127 1 71
stylus-diffusion/stylus
Language:Jupyter Notebook118 9 16
PairCustomization/PairCustomization
Language:Python86 5 75
Mowenyii/PAE
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
Language:Python55 3 18
CodeGoat24/Face-diffuser
[CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.
Language:Python37 2 01
wfanyue/DPG-T2I-Personalization
[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Language:Python32 4 31
PrototypeNx/DETEX
Decoupled Textual Embeddings for Customized Image Generation (AAAI 2024)
Language:Python16