ZhichaoZuo

ZhichaoZuo's Stars

lllyasviel/ControlNet
Let us control diffusion models!
Language:Python31.1k2.8k
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.5k200
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
Language:Jupyter Notebook5.8k1.3k
chunhuizhang/deep_learning_notes
42
BrianPulfer/PapersReimplementations
Personal short implementations of Machine Learning papers
Language:Jupyter Notebook23754
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.3k1.3k
zhenzhiwang/HumanVid
[NeurIPS D&B Track 2024] Official implementation of HumanVid
Language:Python2674
G-U-N/Be-Your-Outpainter
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
Language:Python2229
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.9k2.8k
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
2k28
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.4k1.4k
Francis-Rings/MotionEditor
[CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.
Language:Python1537
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Jupyter Notebook3.7k319
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k517
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML11.3k952
lllyasviel/IC-Light
More relighting!
Language:Python7.2k415
harlanhong/CVPR2022-DaGAN
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Language:Python976126
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11k2.3k
yzhou359/MakeItTalk
Language:Jupyter Notebook981220
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python12.1k2.3k
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.8k994
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. （某大型游戏线上攻略，内含炫酷算术魔法）
Language:TypeScript21.8k4k
charlax/professional-programming
A collection of learning resources for curious software engineers
Language:Python46.9k3.7k
krahets/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
Language:Java104k13.1k
nageoffer/12306
🔥 官方推荐 🔥 大学春招、秋招、应届项目，SpringBoot3 + Java17 + SpringCloud Alibaba + Vue3 等技术架构，完成高仿铁路 12306 用户 + 抢票 + 订单 + 支付服务，帮助学生主打就业的项目。
Language:Java2.8k247
testerSunshine/12306
12306智能刷票，订票
Language:Python33.9k9.8k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.8k878
ZuodaoTech/everyone-can-use-english
人人都能用英语
Language:TypeScript25.5k3.8k
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python28.2k2.8k
practical-tutorials/project-based-learning
Curated list of project-based tutorials
209k27.3k