zhenming33

zhenming33's Stars

Klaus-Chow/Model-Deployment-And-Inference
涉及到pytorch模型移动端的部署，集成一些主流的目标检测、文本检测和文本识别算法，提供了torch模型到onnx模型的通用接口，onnx转ncnn模型的功能，移动端模型的量化功能以及模型的推理函数。
Language:C++9
tw93/Pake
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
Language:Rust34k6k
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
Language:Python7.9k1k
Kwai-Kolors/Kolors
Kolors Team
Language:Python4.1k303
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.7k122
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目
1.9k186
TongTong313/rectified-flow
从零手搓Flow Matching（Rectified Flow）
Language:Python23712
YangLing0818/VideoTetris
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
Language:Python2116
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.6k1.5k
obsproject/obs-studio
OBS Studio - Free and open source software for live streaming and screen recording
Language:C61.6k8.1k
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.7k1.3k
liutaocode/talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
Language:Python778
liutaocode/DiffDub
[ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder
Language:Python548
nwojke/deep_sort
Simple Online Realtime Tracking with a Deep Association Metric
Language:Python5.4k1.5k
abewley/sort
Simple, online, and realtime tracking of multiple objects in a video sequence.
Language:Python4k1.1k
tcwang0509/TalkingHead-1KH
Language:Python13318
JosephPai/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
1.4k114
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
Language:C++13.7k3.4k
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Language:Dockerfile68.6k8.8k
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
1.8k89
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
Language:Python2.2k373
TadasBaltrusaitis/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Language:MATLAB7k1.9k
MRzzm/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python1k180
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
Language:JavaScript9.7k1.8k
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python4.8k592
1adrianb/face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
Language:Python7.2k1.4k
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Language:Python1.6k239
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Jupyter Notebook3.8k322
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11.1k2.4k
numz/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
Language:Python1.3k177