zhenming33's Stars
Klaus-Chow/Model-Deployment-And-Inference
涉及到pytorch模型移动端的部署,集成一些主流的目标检 测、文本检测和文本识别算法,提供了torch模型到onnx模型的通用接 口,onnx转ncnn模型的功能,移动端模型的量化功能以及模型的推理函数。
tw93/Pake
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
Kwai-Kolors/Kolors
Kolors Team
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
TongTong313/rectified-flow
从零手搓Flow Matching(Rectified Flow)
YangLing0818/VideoTetris
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
KwaiVGI/LivePortrait
Bring portraits to life!
obsproject/obs-studio
OBS Studio - Free and open source software for live streaming and screen recording
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
liutaocode/talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
liutaocode/DiffDub
[ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder
nwojke/deep_sort
Simple Online Realtime Tracking with a Deep Association Metric
abewley/sort
Simple, online, and realtime tracking of multiple objects in a video sequence.
tcwang0509/TalkingHead-1KH
JosephPai/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
TadasBaltrusaitis/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
MRzzm/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
1adrianb/face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
numz/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111