fullstackpeng

眼前的一切，仿佛已跟我远离，消逝的一切，却又在化为现实。

fullstackpeng's Stars

microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++15.2k3k
TadasBaltrusaitis/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Language:MATLAB7k1.9k
Fictionarry/TalkingGaussian
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
Language:Python29335
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.1k4.2k
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.4k163
ossrs/srs
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Language:C++26.1k5.4k
AlistGo/alist
🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序，使用 Gin 和 Solidjs。
Language:Go45.3k5.9k
zzj1111/Preprocessed-CMLR-Dataset-For-Wav2Lip
Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would train Wav2Lip on CMLR. Wish it would do better in Chinese.
Language:Python607
Aruen24/wav2lip_288x288_test
Language:Python4918
mesolitica/malaya-speech
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
Language:Jupyter Notebook24242
bmild/nerf
Code release for NeRF (Neural Radiance Fields)
Language:Jupyter Notebook10.1k1.4k
facefusion/facefusion
Industry leading face manipulation platform
Language:Python20.7k3.2k
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Language:Python16.7k110
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Language:Python29.1k3.6k
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python3.2k412
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k6.4k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30.3k3k
XPixelGroup/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
Language:Python7k1.2k
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Language:Python36.1k6k
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。
Language:Python11.4k1.3k
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python7k1.3k
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python78192
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.6k4.5k
wooorm/franc
Natural language detection
Language:JavaScript4.2k176
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python18.1k1.4k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python73.8k8.8k
jeessy2/ddns-go
Simple and easy to use DDNS. Support Aliyun, Tencent Cloud, Dnspod, Cloudflare, Callback, Huawei Cloud, Baidu Cloud, Porkbun, GoDaddy, Namecheap, NameSilo...
Language:Go12.8k1.5k
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.7k451
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.9k349
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.6k803

fullstackpeng

fullstackpeng's Stars

microsoft/onnxruntime

TadasBaltrusaitis/OpenFace

Fictionarry/TalkingGaussian

microsoft/DeepSpeed

ZiqiaoPeng/SyncTalk

ossrs/srs

AlistGo/alist

zzj1111/Preprocessed-CMLR-Dataset-For-Wav2Lip

Aruen24/wav2lip_288x288_test

mesolitica/malaya-speech

bmild/nerf

facefusion/facefusion

iperov/DeepFaceLab

xinntao/Real-ESRGAN

TMElyralab/MuseTalk

facebookresearch/fairseq

myshell-ai/OpenVoice

XPixelGroup/BasicSR

TencentARC/GFPGAN

jianchang512/pyvideotrans

jaywalnut310/vits

idiap/coqui-ai-TTS

coqui-ai/TTS

wooorm/franc

fishaudio/fish-speech

openai/whisper

jeessy2/ddns-go

snakers4/silero-vad

FunAudioLLM/SenseVoice

modelscope/FunASR