Pinned Repositories
abd-skin-segmentation
Deep learning techniques for skin segmentation on novel abdominal dataset. Work conducted as part of the development process of an autonomous robotic ultrasound system.
AD-NeRF
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
age-estimation-pytorch
PyTorch-based CNN implementation for estimating age from face images
age-gender-estimation
Keras implementation of a CNN network for age and gender estimation
ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
AOT-GAN-for-Inpainting
[TVCG] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
apollo
An open autonomous driving platform
llama
Inference code for LLaMA models
stable-diffusion-webui
Stable Diffusion web UI
xxmiprai's Repositories
xxmiprai/llama
Inference code for LLaMA models
xxmiprai/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
xxmiprai/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
xxmiprai/AOT-GAN-for-Inpainting
[TVCG] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
xxmiprai/backgroundremover
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
xxmiprai/bitsandbytes
8-bit CUDA functions for PyTorch
xxmiprai/Bringing-Old-Photos-Back-to-Life
Bringing Old Photo Back to Life (CVPR 2020 oral)
xxmiprai/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
xxmiprai/EMO
xxmiprai/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
xxmiprai/HumanSD
[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"
xxmiprai/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
xxmiprai/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
xxmiprai/magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
xxmiprai/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
xxmiprai/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
xxmiprai/NudeNet-Onnx-TensorRT-BatchedNMS
xxmiprai/nvidia-patch
This patch removes restriction on maximum number of simultaneous NVENC video encoding sessions imposed by Nvidia to consumer-grade GPUs.
xxmiprai/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
xxmiprai/OpenVoice
Instant voice cloning by MyShell.
xxmiprai/ReliableSwap
Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'
xxmiprai/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
xxmiprai/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
xxmiprai/sd-webui-controlnet
WebUI extension for ControlNet
xxmiprai/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
xxmiprai/so-vits-svc
SoftVC VITS Singing Voice Conversion
xxmiprai/SpleeterGui
Windows desktop front end for Spleeter - AI source separation
xxmiprai/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml/gguf), Llama models.
xxmiprai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
xxmiprai/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities