anzw's Stars
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
pengzhile/pandora
潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.
openai/triton
Development repository for the Triton language and compiler
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
numba/numba
NumPy aware dynamic Python compiler using LLVM
PeterL1n/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
cupy/cupy
NumPy & SciPy for GPU
TadasBaltrusaitis/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
nadermx/backgroundremover
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
dpkp/kafka-python
Python client for Apache Kafka
confluentinc/confluent-kafka-python
Confluent's Kafka Python Client
PyAV-Org/PyAV
Pythonic bindings for FFmpeg's libraries.
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
filipecalegario/awesome-generative-ai
A curated list of Generative AI tools, works, models, and references
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
CoinCheung/BiSeNet
Add bisenetv2. My implementation of BiSeNet
s9roll7/ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
pytorch/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
OpenTalker/StyleHEAT
[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation
wyhsirius/LIA
[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation
primepake/wav2lip_288x288
saifhassan/Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
guanjz20/StyleSync
Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
OpenVisualCloud/SVT-VP9
SVT VP9 encoder. Scalable Video Technology (SVT) is a software-based video coding technology that is highly optimized for Intel® Xeon® processors. Using the open source SVT-VP9 encoder, it is possible to spread video encoding processing across multiple Intel® Xeon® processors to achieve a real advantage of processing efficiency.
salinaaaaaa/NVIDIA-GPU-Tensor-Core-Accelerator-PyTorch-OpenCV
Computer vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA 11.8, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), PyTorch, PyTorch geometric for Graph Neural Networks, TF2, Tensorboard, and OpenCV for accelerated workloads on NVIDIA Tensor cores and GPUs.
xuanandsix/GFPGAN-onnxruntime-demo
This is the onnxruntime inference code for GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior (CVPR 2021). Official code: https://github.com/TencentARC/GFPGAN
cudawarped/opencv-python-cuda-wheels
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
bychen7/Face-Restoration-TensorRT
A simple face restoration TensorRT deployment solution.
triton-inference-server/tensorrt_backend
The Triton backend for TensorRT.
qbxlvnf11/convert-pytorch-onnx-tensorrt
Converting weights of Pytorch models to ONNX & TensorRT engines