windwang's Stars
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
PaddlePaddle/PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Ackites/KillWxapkg
自动化反编译微信小程序,小程序安全评估工具,发现小程序安全问题,自动解密,解包,可还原工程目录,支持Hook,小程序修改
QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
leminlimez/Nugget
Unlock the fullest potential of your device
Standard-Intelligence/hertz-dev
first base model for full-duplex conversational audio
microsoft/data-formulator
🪄 Create rich visualizations with AI
Tencent/Tencent-Hunyuan-Large
declare-lab/tango
A family of diffusion models for text-to-audio generation.
ajay-sainy/Wav2Lip-GFPGAN
High quality Lip sync
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
modstart-lib/linkandroid
Link Android and PC easily! 全能手机连接助手!
edwko/OuteTTS
Interface for OuteTTS models.
Henry-23/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
tmplink/nsfw_detector
Solution for checking file if contain NSFW content.
saifhassan/Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
physicsexpert/Exlink_Tool
Exlink Tool是一款优雅的嵌入式多功能调试器
deltacv/PaperVision
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision using live previews as you edit.
LukeForeverYoung/UReader
bklieger-groq/gradio-groq-basics
Building Blocks for Multi-Modal Gradio Powered by Groq Apps
mowshon/lipsync
lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based on provided audio, supports CPU/CUDA, and uses caching for faster processing.
instant-high/wav2lip-onnx-HQ
Full version of wav2lip-onnx including face alignment and face enhancement and more...
Dartvauder/NeuroSandboxWebUI
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
bjfrbjx/stream-wav2lip
优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs
pumpkin-ws/HandEyeCalib
NeverMoreLCH/SearchLVLMs
Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge"
Yizhe-Liu/SplatPosePlus
russellrapier/Easy-Wav2lip