gxrxrdx's Stars
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
bionic-gpt/bionic-gpt
BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality
finnless/yt-summarizer
🦜️🔗📺 A langchain summarizer for YouTube videos.
metaswang/bao
Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed
HumanAIGC/Cloth2Tex
Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On
BillFSmith/TilingZoeDepth
isl-org/ZoeDepth
Metric depth estimation from a single image
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
aimagelab/dress-code
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
miccunifi/ladi-vton
[ACM MM 2023] - LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
s0md3v/roop
one-click face swap
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
TheLastBen/fast-stable-diffusion
fast-stable-diffusion + DreamBooth
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Stability-AI/StableStudio
Community interface for generative AI
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
mingyuan-zhang/MotionDiffuse
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
carolineec/EverybodyDanceNow
Motion Retargeting Video Subjects