chenyingshu's Stars
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Paitesanshi/LLM-Agent-Survey
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
YingqingHe/ScaleCrafter
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
dvlab-research/LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
Kobaayyy/Awesome-CVPR2024-ECCV2024-AIGC
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
naver-ai/Visual-Style-Prompting
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
elliotwaite/pytorch-to-javascript-with-onnx-js
Run PyTorch models in the browser using ONNX.js
AlonzoLeeeooo/awesome-video-generation
A collection of awesome video generation studies.
yardenfren1996/B-LoRA
Implicit Style-Content Separation using B-LoRA
yuweihao/MM-Vet
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
cure-lab/PnPInversion
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
Ascend/pytorch
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
PKU-YuanGroup/Cycle3D
Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
nuster1128/LLM_Agent_Memory_Survey
cgtuebingen/SIGNeRF
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
iveevi/ngf
Source code for the SIGGRAPH 2024 conference paper "Neural Geometry Fields for Meshes"
friedrichor/Awesome-Multimodal-Papers
A curated list of awesome Multimodal studies.
ActiveVisionLab/gaussctrl
[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
bkhanal-11/awesome-360-depth-estimation
State-of-the-art papers for depth estimation of 360 images.
qibao77/Detect-the-common-of-images
implementation of "Matching local self-similarities across images and videos"
zhengziqiang/MarineInst20M
The official dataset repository of "MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description". ECCV [Oral] 2024.
facebookresearch/WaSt3D
Geometry style transfer colorbook