yangkang2021's Stars
zzj1111/Preprocessed-CMLR-Dataset-For-Wav2Lip
Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would train Wav2Lip on CMLR. Wish it would do better in Chinese.
Markfryazino/wav2lip-hq
Extension of Wav2Lip repository for processing high-quality videos.
IanMagnusson/Wav2Lip-Emotion
Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We also propose a novel automatic evaluation for emotion modification corroborated with a user study.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
githubnext/monaspace
An innovative superfamily of fonts for code
SawyerHood/draw-a-ui
Draw a mockup and generate html for it
tpulkit/txt2vid_browser
wujinzhong/Wav2Lip_TensorRT
HassanMuhammadSannaullah/Wav2lip-Fix-For-Inference
This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audio. The original project was dependent on Python 3.6 and used deprecated libraries. This project fixes those problems so that Wav2Lip can now run on Python 3.9. or higher
SkyFlap/Digital-Life-DL-B
本次开源为DL-B,是一个基于ChatGLM、Wav2Lip、So-VITS组建的数字形象方案。可以在此基础之上增加其他组件达到数字生命的效果。This open source is DL-B, which is a digital image scheme based on ChatGLM, Wav2Lip and So-VITS. On this basis, other components can be added to achieve the effect of digital life.
nghiakvnvsd/wav2lip_data_preprocessing
rogerle/wav2lip_train
innnky/emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
onuralpszr/GFPGAN-ncnn-vulkan
[WIP] NCNN with Vulkan implementation of GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration
Baiyuetribe/ncnn-models
awesome AI models with NCNN, and how they were converted ✨✨✨
yanyiwu/cppjieba
"结巴"中文分词的C++版本
NaruseMioShirakana/SoftVC-Vits-Singing-Voice-Conversion-Onnx-Export
SoftVC Vits Singing Voice Conversion —— 基于Vits的歌声音色转换网络Onnx导出
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
AaronFeng753/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
aselsan-research-imaging-team/bicubic-plusplus
Bicubic++ (NTIRE @ CVPR 2023, Real Time Super Resolution Track 2 winner)
Ysnower/bicubic-plusplus
An unofficial bicubic++ repo
microsoft/Web-Dev-For-Beginners
24 Lessons, 12 Weeks, Get Started as a Web Developer
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
antirez/smallchat
A minimal programming example for a chat server
caizhongang/SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"