imabackstabber's Stars
ggerganov/llama.cpp
LLM inference in C/C++
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
openai/shap-e
Generate 3D objects conditioned on text or images
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Johnserf-Seed/TikTokDownload
抖音去水印批量下载用户主页作品、喜欢、收藏、图文、音频
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
thu-ml/prolificdreamer
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
bytedance/MVDream-threestudio
3D generation code for MVDream
vsitzmann/phd-master-application-docs
A collection of the application documents I used to apply to universities in the US.
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
neobundy/Deep-Dive-Into-AI-With-MLX-PyTorch
"Deep Dive into AI with MLX and PyTorch" is an educational initiative designed to help anyone interested in AI, specifically in machine learning and deep learning, using Apple's MLX and Meta's PyTorch frameworks.
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
3DTopia/GPTEval3D
[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"
wyysf-98/SweetDreamer
PKU-YuanGroup/Cycle3D
[AAAI 2025🔥] Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
Perp-Neg/Perp-Neg-stablediffusion
source code for Stable Diffusion with Perp-Neg
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
IDEA-Research/ED-Pose
[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "
liuff19/DreamReward
[ECCV 2024] DreamReward: Text-to-3D Generation with Human Preference
mlpc-ucsd/TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
lutao2021/BrightDreamer
BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis
AbrahamYabo/Cascade-Zero123
linjing7/ChatHuman
VITA-Group/3D-Mode-Collapse
"Taming Mode Collapse in Score Distillation for Text-to-3D Generation" by Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra
ostadabbas/Seeing-Under-the-Cover
Seeing Under the Cover: A Physics Guided Learning Approach for In-Bed Pose Estimation (MICCAI2019)