LuckWan

LuckWan's Stars

open-mmlab/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型，可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成，只需要一个模型
Language:Python74447
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Language:Python20.2k2k
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
Language:Jupyter Notebook11.3k1.3k
LLaVA-VL/LLaVA-NeXT
Language:Python3.3k288
langgptai/LangGPT
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt，Language of GPT, 结构化提示词，结构化Prompt
Language:Jupyter Notebook7.6k618
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.8k523
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.9k112
swordlidev/Evaluation-Multimodal-LLMs-Survey
A Survey on Benchmarks of Multimodal Large Language Models
786
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.5k857
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python37.9k4.7k
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.7k447
KovenYu/WonderJourney
Language:Python72542
KovenYu/WonderWorld
Code release for https://kovenyu.com/WonderWorld/
Language:Python40116
aim-uofa/Framer
Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
Language:Python39818
TrickyGo/SinMPI
Pytorch implementation of SinMPI (SIGGRAPH Asia 2023)
Language:Python534
thuanz123/realfill
Unofficial implementation of RealFill
Language:Jupyter Notebook36427
yxuhan/AdaMPI
[SIGGRAPH 2022] Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images
Language:Python22125
HengyiWang/spann3r
[3DV'25] 3D Reconstruction with Spatial Memory
Language:Python85641
naver/mast3r
Grounding Image Matching in 3D with MASt3R
Language:Python1.6k122
zju3dv/EfficientLoFTR
Code for "Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed", CVPR 2024
Language:Jupyter Notebook67549
andyzeng/tsdf-fusion-python
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Language:Python1.3k219
apple/ml-neuman
Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)
Language:Python1.3k145
cswry/OSEDiff
[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution
Language:Python29221
apple/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Language:Python4k281
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python1.1k40
16lemoing/dot
Dense Optical Tracking: Connecting the Dots
Language:Python26417
Picsart-AI-Research/MI-GAN
[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices
Language:Python51046
autonomousvision/unimatch
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
Language:Python1.2k119
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Language:Jupyter Notebook1.6k117
Pokerlishao/LoopGaussian
Language:Python381