liuziwei7's Stars
3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
LLaVA-VL/LLaVA-NeXT
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
jiawei-ren/dreamgaussian4d
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
hitcslj/Awesome-AIGC-3D
A curated list of awesome AIGC 3D papers
ashawkey/InTeX
Interactive Text-to-Texture Synthesis via Unified Depth-aware Inpainting.
jzhang38/LongMamba
Some preliminary explorations of Mamba's context scaling.
3DTopia/ThemeStation
wyf0912/SinSR
[CVPR 2024] SinSR: Diffusion-Based Image Super-Resolution in a Single Step
sherwinbahmani/tc4d
TC4D: Trajectory-Conditioned Text-to-4D Generation
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
mingyuan-zhang/LMM
Large Motion Model for Unified Multi-Modal Motion Generation
Aleafy/Make_it_Real
Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials
lisiyao21/Duolando
Code for ICLR 2024 paper "Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment"
QY-H00/attention-interpolation-diffusion
Interpolation Between Text-to-Image Generation!
ziqihuangg/Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
ashawkey/objaverse_filter
naive filter of objaverse
Jingkang50/PSG4D
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
king159/svd-mv
Unofficial Implementation of "Stable Video Diffusion Multi-View"
TaoHuUMD/SurMo
TaoHuUMD/StructLDM
AtsuMiyai/UPD
[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
ldkong1205/Calib3D
Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
wqyin/WHAC
Official Code for "WHAC: World-grounded Humans and Cameras"
shulin16/MMInA
Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"
arthur-qiu/FreeNoise-LaVie
[ICLR 2024] Code for FreeNoise based on LaVie
Vchitect/Optix
Memory Efficient Training Framework for Large Video Generation Model
youquanl/M3Net
Multi-Space Alignments Towards Universal LiDAR Segmentation