XuDL's Stars
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
flowtyone/flowty-realtime-lcm-canvas
A realtime sketch to image demo using LCM and the gradio library.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
0xbitches/sd-webui-lcm
Latent Consistency Model for AUTOMATIC1111 Stable Diffusion WebUI
chenxwh/insanely-fast-whisper
Incredibly fast Whisper-large-v3
Samueli924/chaoxing
超星学习通/超星尔雅/泛雅超星全自动无人值守完成任务点
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
upscayl/upscayl
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
openai/consistencydecoder
Consistency Distilled Diff VAE
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Project-DARC/DARC
Decentralized Autonomous Regulated Company (DARC), a company virtual machine that runs on any EVM-compatible blockchain, with on-chain law system, multi-level tokens and dividends mechanism.
SagerNet/sing-box
The universal proxy platform
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
hkchengrex/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
NaruseMioShirakana/DragonianVoice
多个SVC/TTS的C++推理库
Kahsolt/stable-diffusion-webui-prompt-travel
Travel between prompts in the latent space to make pseudo-animation, extension script for AUTOMATIC1111/stable-diffusion-webui.
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Gourieff/sd-webui-reactor-force
(DEPRECATED) Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111, SD.Next, Cagliostro) with NVIDIA GPU Support
zju3dv/4K4D
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
emptysuns/Hi_Hysteria
Hello World!非钟国优化线路使用不佳?不想中转?hysteria一键搞定。
yumingj/Text2Human
Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation
lxhao61/integrated-examples
以 V2Ray(v4 版) 或 Xray、Nginx 或 Caddy(v2 版)、Hysteria 等打造常用科学上网的最优组合示例及优化配置,且提供集成特定插件的 Caddy(v2 版) 文件,分享给大家食用及自己备份。
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
EmbraceAGI/LifeReloaded
A life simulation Game powered by GPT-4's “Advanced Data Analysis” function , offering you a second chance at life. 由GPT4的Advanced Data Analysis功能驱动的人生重来模拟器,给您人生第二春。
obsproject/obs-studio
OBS Studio - Free and open source software for live streaming and screen recording
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
nfc-tools/libnfc
Platform independent Near Field Communication (NFC) library
johnnyb/nfc-sun-decoder
A Decoder for NXP 424 DNA SUN (Secure Unique) messages