Pinned Repositories
3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
AD-NeRF
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
ai-makers-kit
GiGA Genie AI Makers Kit for Raspberry Pi
ai-tech-interview
π©βπ»π¨βπ» AI μμ§λμ΄ κΈ°μ λ©΄μ μ€ν°λ (βοΈ 1k+)
albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
albumentations_examples
Augmentations usage examples for albumentations library
anichat
awesome-talking-head-generation
bilayer-model
catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
slaustld's Repositories
slaustld/slaustld
slaustld/dlib
A toolkit for making real world machine learning and data analysis applications in C++
slaustld/ultralytics
NEW - YOLOv8 π in PyTorch > ONNX > OpenVINO > CoreML > TFLite
slaustld/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
slaustld/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
slaustld/so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
slaustld/so-vits-svc
SoftVC VITS Singing Voice Conversion
slaustld/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
slaustld/IP_LAP
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
slaustld/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
slaustld/LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
slaustld/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
slaustld/RVC-VITS
Few-shot multilingual tts with RVC and Vits
slaustld/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
slaustld/pytorchvideo
A deep learning library for video understanding research.
slaustld/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
slaustld/requests
A simple, yet elegant, HTTP library.
slaustld/face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
slaustld/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
slaustld/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
slaustld/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
slaustld/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
slaustld/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
slaustld/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
slaustld/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
slaustld/instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
slaustld/MetaPortrait
[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
slaustld/albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
slaustld/awesome-talking-head-generation
slaustld/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code