nemonameless's Stars
OpenBMB/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
myshell-ai/JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
LLaVA-VL/LLaVA-NeXT
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
multimodal-art-projection/MAP-NEO
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
Picsart-AI-Research/PAIR-Diffusion
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vision-CAIR/MiniGPT4-video
Official code for MiniGPT4-video
RUCAIBox/LLMBox
A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
SiatMMLab/Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
mira-space/MiraData
lijiannuist/Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
badripatro/simba
Simba
soraw-ai/Awesome-Text-to-Video-Generation
A list for Text-to-Video, Image-to-Video works
zju-pi/diff-sampler
An open-source toolbox for fast sampling of diffusion models. Official implementations for our [CVPR-2024, ICML-2024] papers
Zeqiang-Lai/OpenDMD
Open source implementation and models of One-step Diffusion with Distribution Matching Distillation
Yangzhangcst/Mamba-in-CV
A paper list of some recent Mamba-based CV works.
ReaFly/Awesome-Vision-Mamba
✨✨Latest Papers on Vision Mamba and Related Areas
discus0434/aesthetic-predictor-v2-5
SigLIP-based Aesthetic Score Predictor
shim0114/SSM-Meets-Video-Diffusion-Models
xinghaochen/DECO
Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"