Pinned Repositories
deep_image_prior_pytorch
A quick attempt to reproduce the work Deep Image Prior for image denoising.
diffusers_stablediff_conversion
converts huggingface diffusers stablediffussion models to stablediffusion ckpt files usable in most opensource tools
ecu_datalog_analysis
Jupyter notebook containing script to read ECU data log for analysis
generative-inpainting-pytorch
A PyTorch reimplementation for paper Generative Image Inpainting with Contextual Attention (https://arxiv.org/abs/1801.07892)
idao2019_submission
Submission for International Data Analysis Olympiad (IDAO) 2019
inswapper
One-click Face Swapper and Restoration powered by insightface 🔥
knn-matting
Python implementation of KNN Matting, CVPR 2012 / TPAMI 2013 http://dingzeyu.li/projects/knn/
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
MegaDepth
Code of single-view depth prediction algorithm on Internet Photos described in "MegaDepth: Learning Single-View Depth Prediction from Internet Photos, Z. Li and N. Snavely, CVPR 2018".
Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
xiankgx's Repositories
xiankgx/inswapper
One-click Face Swapper and Restoration powered by insightface 🔥
xiankgx/Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
xiankgx/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
xiankgx/TalkingHead-1KH
xiankgx/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
xiankgx/wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
xiankgx/AIT
xiankgx/bark
🔊 Text-Prompted Generative Audio Model
xiankgx/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
xiankgx/cog-musicgen-fine-tuner
This is a cog implementation of the fine-tuner for Meta's MusicGen
xiankgx/DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
xiankgx/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
xiankgx/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
xiankgx/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
xiankgx/generative-models
Generative Models by Stability AI
xiankgx/HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
xiankgx/ImageBind
ImageBind One Embedding Space to Bind Them All
xiankgx/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
xiankgx/LipSick
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
xiankgx/Moore-AnimateAnyone
xiankgx/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
xiankgx/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
xiankgx/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
xiankgx/riffusion
Stable diffusion for real-time music generation
xiankgx/SEINE
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
xiankgx/Simple-Magic-Animate
A simple magic animate pipeline including densepose inference.
xiankgx/TranSalNet
TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)
xiankgx/Unconditional-MusicGen-Trainer
fine-tuning MusicGen without prompts to generate music with a specific style
xiankgx/VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
xiankgx/Wav2Lip-GFPGAN
High quality Lip sync