Yuliang-Zou
Research Scientist at Waymo. Former Ph.D. at Virginia Tech (@vt-vl-lab). Ex-intern at Adobe, NEC Labs, Google, and Waymo.
WaymoMountain View
Yuliang-Zou's Stars
s0md3v/roop
one-click face swap
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
instaloader/instaloader
Download pictures (or videos) along with their captions and other metadata from Instagram.
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
threestudio-project/threestudio
A unified framework for 3D content generation.
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
openai/consistencydecoder
Consistency Distilled Diff VAE
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
OPEN-AIR-SUN/mars
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving
hitcslj/Awesome-AIGC-3D
A curated list of awesome AIGC 3D papers
liuyuan-pal/NeRO
[SIGGRAPH2023] NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images
LinkSoul-AI/Chinese-LLaVA
支持中英文双语视觉-文本对话的开源可商用多模态模型。
tobiasfshr/map4d
Photo-realistic mapping of dynamic urban areas
chaoswork/llm_illustrated
看图学大模型
PJLab-ADG/OASim
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving
daveredrum/SceneTex
[CVPR 2024 Highlight] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
kaixindelele/ChatOpenReview
Crowdfunding open source projects: use OpenReview's high-quality review data to fine-tune a professional review and response LLM. 众筹开源项目:利用OpenReview的优质审稿数据,微调出一个专业的审稿和审稿回复GPT
yklcs/jaxsplat
3D Gaussian Splatting in JAX
LemonATsu/NPC-pytorch
Pytorch Implementation for Neural Point Characters (NPC)
LemonATsu/CUDA-kNN-Aniso-Gaussian-Feature-Aggregation