tuqizhao's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
2noise/ChatTTS
A generative speech model for daily dialogue.
niklasvh/html2canvas
Screenshots with JavaScript
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
facefusion/facefusion
Industry leading face manipulation platform
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
PeterL1n/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
XPixelGroup/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
hustvl/4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
ashawkey/torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
adamschwartz/chrome-tabs
Chrome-style tabs in HTML/CSS.
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
blackrosezy/gui-inspect-tool
Gui Inspect tool for Windows
MRzzm/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
nwjs-ffmpeg-prebuilt/nwjs-ffmpeg-prebuilt
FFmpeg prebuilt binaries for NW.js / Chromium
ShenhanQian/GaussianAvatars
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
skhu101/GauHuman
Code for our CVPR'2024 paper "GauHuman: Articulated Gaussian Splatting from Monocular Human Videos"
cvlab-kaist/GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
Fictionarry/TalkingGaussian
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
NWPU66/Fundamentals-Of-Computer-Graphics-5th-CN
计算机图形学 虎书 Fundamentals-Of-Computer-Graphics 第5版学习笔记(中文翻译)
ashawkey/nerf_template
a simple template for practicing NeRF.
lipku/python_rtmpstream
python库,实现推送实时rtmp音视频流