ZhichaoZuo's Stars
lllyasviel/ControlNet
Let us control diffusion models!
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
chunhuizhang/deep_learning_notes
BrianPulfer/PapersReimplementations
Personal short implementations of Machine Learning papers
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
zhenzhiwang/HumanVid
[NeurIPS D&B Track 2024] Official implementation of HumanVid
G-U-N/Be-Your-Outpainter
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
Stability-AI/generative-models
Generative Models by Stability AI
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
KwaiVGI/LivePortrait
Bring portraits to life!
Francis-Rings/MotionEditor
[CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
lllyasviel/IC-Light
More relighting!
harlanhong/CVPR2022-DaGAN
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
yzhou359/MakeItTalk
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
charlax/professional-programming
A collection of learning resources for curious software engineers
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
nageoffer/12306
🔥 官方推荐 🔥 大学春招、秋招、应届项目,SpringBoot3 + Java17 + SpringCloud Alibaba + Vue3 等技术架构,完成高仿铁路 12306 用户 + 抢票 + 订单 + 支付服务,帮助学生主打就业的项目。
testerSunshine/12306
12306智能刷票,订票
guoyww/AnimateDiff
Official implementation of AnimateDiff.
ZuodaoTech/everyone-can-use-english
人人都能用英语
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
practical-tutorials/project-based-learning
Curated list of project-based tutorials