yangkang2021

yangkang2021's Stars

zzj1111/Preprocessed-CMLR-Dataset-For-Wav2Lip
Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would train Wav2Lip on CMLR. Wish it would do better in Chinese.
Language:Python547
Markfryazino/wav2lip-hq
Extension of Wav2Lip repository for processing high-quality videos.
Language:Python535252
IanMagnusson/Wav2Lip-Emotion
Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We also propose a novel automatic evaluation for emotion modification corroborated with a user study.
Language:Python9417
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python68.8k8.1k
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python84k6.5k
githubnext/monaspace
An innovative superfamily of fonts for code
Language:TypeScript14k232
SawyerHood/draw-a-ui
Draw a mockup and generate html for it
Language:TypeScript13.2k1.6k
tpulkit/txt2vid_browser
Language:TypeScript114
wujinzhong/Wav2Lip_TensorRT
Language:Python273
HassanMuhammadSannaullah/Wav2lip-Fix-For-Inference
This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audio. The original project was dependent on Python 3.6 and used deprecated libraries. This project fixes those problems so that Wav2Lip can now run on Python 3.9. or higher
Language:Python184
SkyFlap/Digital-Life-DL-B
本次开源为DL-B，是一个基于ChatGLM、Wav2Lip、So-VITS组建的数字形象方案。可以在此基础之上增加其他组件达到数字生命的效果。This open source is DL-B, which is a digital image scheme based on ChatGLM, Wav2Lip and So-VITS. On this basis, other components can be added to achieve the effect of digital life.
Language:Python10515
nghiakvnvsd/wav2lip_data_preprocessing
Language:Python3117
rogerle/wav2lip_train
Language:Python3010
innnky/emotional-vits
无需情感标注的情感可控语音合成模型，基于VITS
Language:Jupyter Notebook1.3k167
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python12.1k999
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Language:Python17.7k1.3k
onuralpszr/GFPGAN-ncnn-vulkan
[WIP] NCNN with Vulkan implementation of GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration
Language:C++397
Baiyuetribe/ncnn-models
awesome AI models with NCNN, and how they were converted ✨✨✨
Language:C++24929
yanyiwu/cppjieba
"结巴"中文分词的C++版本
Language:C++2.6k691
NaruseMioShirakana/SoftVC-Vits-Singing-Voice-Conversion-Onnx-Export
SoftVC Vits Singing Voice Conversion —— 基于Vits的歌声音色转换网络Onnx导出
Language:Python297
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.5k333
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.5k280
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python26.1k2.6k
AaronFeng753/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
Language:C++12.9k873
aselsan-research-imaging-team/bicubic-plusplus
Bicubic++ (NTIRE @ CVPR 2023, Real Time Super Resolution Track 2 winner)
Language:Python19117
Ysnower/bicubic-plusplus
An unofficial bicubic++ repo
Language:Python606
microsoft/Web-Dev-For-Beginners
24 Lessons, 12 Weeks, Get Started as a Web Developer
Language:JavaScript83.2k12.3k
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
Language:Python53k13.5k
antirez/smallchat
A minimal programming example for a chat server
Language:C7.3k809
caizhongang/SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
Language:Python97669