Yuliang-Zou

Research Scientist at Waymo. Former Ph.D. at Virginia Tech (@vt-vl-lab). Ex-intern at Adobe, NEC Labs, Google, and Waymo.

WaymoMountain View

Yuliang-Zou's Stars

s0md3v/roop
one-click face swap
Language:Python28.9k 265 07.1k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27k 211 4.4k5.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21k 158 1.6k2.3k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML12.4k 104 241.3k
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.3k 85 38877
instaloader/instaloader
Download pictures (or videos) along with their captions and other metadata from Instagram.
Language:Python9.1k 163 2.2k1.2k
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Language:HTML6.5k 267 59393
threestudio-project/threestudio
A unified framework for 3D content generation.
Language:Jupyter Notebook6.4k 78 335488
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.3k 39 41516
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
4k 33 93195
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python3.1k 29 201219
openai/consistencydecoder
Consistency Distilled Diff VAE
Language:Python2.1k 21 2076
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Language:Python1.7k 25 90204
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language:Python1.6k 77 44138
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
Language:Python1.6k 22 68115
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Language:Python1k 10 3374
3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
Language:Python1k 28 6159
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Language:Jupyter Notebook789 9 3244
OPEN-AIR-SUN/mars
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving
Language:Python689 11 15164
hitcslj/Awesome-AIGC-3D
A curated list of awesome AIGC 3D papers
604 24 021
liuyuan-pal/NeRO
[SIGGRAPH2023] NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images
Language:Python557 10 3737
LinkSoul-AI/Chinese-LLaVA
支持中英文双语视觉-文本对话的开源可商用多模态模型。
Language:Python358 5 932
tobiasfshr/map4d
Photo-realistic mapping of dynamic urban areas
Language:Python238 29 169
chaoswork/llm_illustrated
看图学大模型
Language:Python232 7 014
PJLab-ADG/OASim
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving
Language:Python217 12 1118
daveredrum/SceneTex
[CVPR 2024 Highlight] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
Language:Python200 6 139
kaixindelele/ChatOpenReview
Crowdfunding open source projects: use OpenReview's high-quality review data to fine-tune a professional review and response LLM. 众筹开源项目：利用OpenReview的优质审稿数据，微调出一个专业的审稿和审稿回复GPT
Language:Python197 9 012
yklcs/jaxsplat
3D Gaussian Splatting in JAX
Language:Cuda55 4 24
LemonATsu/NPC-pytorch
Pytorch Implementation for Neural Point Characters (NPC)
Language:Python26 5 13
LemonATsu/CUDA-kNN-Aniso-Gaussian-Feature-Aggregation
Language:Cuda8 3 00

Yuliang-Zou

Yuliang-Zou's Stars

s0md3v/roop

huggingface/diffusers

haotian-liu/LLaVA

liguodongiot/llm-action

karpathy/minbpe

instaloader/instaloader

MrNeRF/awesome-3D-gaussian-splatting

threestudio-project/threestudio

google/gemma_pytorch

deepseek-ai/DeepSeek-V2

PKU-YuanGroup/Video-LLaVA

openai/consistencydecoder

microsoft/LLaVA-Med

omerbt/TokenFlow

invictus717/MetaTransformer

lukasHoel/text2room

3DTopia/OpenLRM

megvii-research/HiDiffusion

OPEN-AIR-SUN/mars

hitcslj/Awesome-AIGC-3D

liuyuan-pal/NeRO

LinkSoul-AI/Chinese-LLaVA

tobiasfshr/map4d

chaoswork/llm_illustrated

PJLab-ADG/OASim

daveredrum/SceneTex

kaixindelele/ChatOpenReview

yklcs/jaxsplat

LemonATsu/NPC-pytorch

LemonATsu/CUDA-kNN-Aniso-Gaussian-Feature-Aggregation