windwang

windwang's Stars

All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python43.2k 329 2k4.8k
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Language:Python36.2k 509 4786k
PaddlePaddle/PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Language:Python8k 107 3661.2k
Ackites/KillWxapkg
自动化反编译微信小程序，小程序安全评估工具，发现小程序安全问题，自动解密，解包，可还原工程目录，支持Hook，小程序修改
Language:Go4.2k 33 63891
QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.8k 35 138263
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python3.3k 53 235419
leminlimez/Nugget
Unlock the fullest potential of your device
Language:Python2k 36 304133
Standard-Intelligence/hertz-dev
first base model for full-duplex conversational audio
Language:Python1.7k 19 26110
microsoft/data-formulator
🪄 Create rich visualizations with AI
Language:TypeScript1.4k 16 1289
Tencent/Tencent-Hunyuan-Large
Language:Python1.3k 25 1570
declare-lab/tango
A family of diffusion models for text-to-audio generation.
Language:Python1.1k 28 5094
ajay-sainy/Wav2Lip-GFPGAN
High quality Lip sync
Language:Python1.1k 15 30272
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Language:Python981 28 38114
modstart-lib/linkandroid
Link Android and PC easily! 全能手机连接助手！
Language:TypeScript880 7 2176
edwko/OuteTTS
Interface for OuteTTS models.
Language:Python808 23 4365
Henry-23/VideoChat
实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
Language:Python611 10 3680
tmplink/nsfw_detector
Solution for checking file if contain NSFW content.
Language:Python422 3 235
saifhassan/Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
Language:Python413 12 4888
physicsexpert/Exlink_Tool
Exlink Tool是一款优雅的嵌入式多功能调试器
374 2 1105
deltacv/PaperVision
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision using live previews as you edit.
Language:Kotlin346 5 113
LukeForeverYoung/UReader
Language:Python128 3 159
bklieger-groq/gradio-groq-basics
Building Blocks for Multi-Modal Gradio Powered by Groq Apps
Language:Python102 2 116
mowshon/lipsync
lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based on provided audio, supports CPU/CUDA, and uses caching for faster processing.
Language:Python97 6 715
instant-high/wav2lip-onnx-HQ
Full version of wav2lip-onnx including face alignment and face enhancement and more...
Language:Python81 6 1115
Dartvauder/NeuroSandboxWebUI
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
Language:Python76 8 811
bjfrbjx/stream-wav2lip
优化wav2lip的执行步骤，将头脸分离、嘴型替换、回补背景三个步骤分离，添加gfpgan强化面部功能，实现提前解帧，流式循环处理，对接obs
Language:Python59 4 815
pumpkin-ws/HandEyeCalib
Language:C++21 1 39
NeverMoreLCH/SearchLVLMs
Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge"
160
Yizhe-Liu/SplatPosePlus
Language:Python60
russellrapier/Easy-Wav2lip
Language:Jupyter Notebook2