gxrxrdx

gxrxrdx's Stars

svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.8k4.8k
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Language:Python5.3k440
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".
Language:Python1k55
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook14.3k2.1k
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Language:Python1.4k139
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.7k597
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Language:Python13.5k848
bionic-gpt/bionic-gpt
BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality
Language:Rust1.9k185
finnless/yt-summarizer
🦜️🔗📺 A langchain summarizer for YouTube videos.
Language:Python7
metaswang/bao
Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed
Language:Python12812
HumanAIGC/Cloth2Tex
Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On
Language:Python450126
BillFSmith/TilingZoeDepth
16328
isl-org/ZoeDepth
Metric depth estimation from a single image
Language:Jupyter Notebook2.3k213
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
Language:Python2.9k237
aimagelab/dress-code
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
Language:Python50562
miccunifi/ladi-vton
[ACM MM 2023] - LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
Language:Python42555
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
5.6k429
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python10.4k1.1k
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python44.7k5.3k
s0md3v/roop
one-click face swap
Language:Python28.4k6.9k
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python142k26.8k
TheLastBen/fast-stable-diffusion
fast-stable-diffusion + DreamBooth
Language:Python7.5k1.3k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python55.1k5.8k
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Language:Jupyter Notebook4.4k383
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.9k2.1k
Stability-AI/StableStudio
Community interface for generative AI
Language:TypeScript8.8k875
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.3k1.1k
mingyuan-zhang/MotionDiffuse
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
Language:Python85274
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python35.1k4.3k
carolineec/EverybodyDanceNow
Motion Retargeting Video Subjects
Language:Python684138