LuckWan's Stars
open-mmlab/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
LLaVA-VL/LLaVA-NeXT
langgptai/LangGPT
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
swordlidev/Evaluation-Multimodal-LLMs-Survey
A Survey on Benchmarks of Multimodal Large Language Models
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
KovenYu/WonderJourney
KovenYu/WonderWorld
Code release for https://kovenyu.com/WonderWorld/
aim-uofa/Framer
Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
TrickyGo/SinMPI
Pytorch implementation of SinMPI (SIGGRAPH Asia 2023)
thuanz123/realfill
Unofficial implementation of RealFill
yxuhan/AdaMPI
[SIGGRAPH 2022] Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images
HengyiWang/spann3r
[3DV'25] 3D Reconstruction with Spatial Memory
naver/mast3r
Grounding Image Matching in 3D with MASt3R
zju3dv/EfficientLoFTR
Code for "Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed", CVPR 2024
andyzeng/tsdf-fusion-python
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
apple/ml-neuman
Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)
cswry/OSEDiff
[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution
apple/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
16lemoing/dot
Dense Optical Tracking: Connecting the Dots
Picsart-AI-Research/MI-GAN
[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices
autonomousvision/unimatch
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Pokerlishao/LoopGaussian